CIS241 Intro to Data Science
  • Home
  • Course Schedule
  • Assignments
  • Course Policies
  • Ground Rules
  • JupyterHub
  1. Course Information
  2. 2  Course Schedule
  • 1  Homepage
  • Course Information
    • 2  Course Schedule
    • 3  Course Policies
    • 4  Ground Rules
    • 5  Criteria for Good Reports
  • Assignments
    • 6  Documentation Assignment
    • 7  Project Plan
    • 8  Presentation
    • 9  Final Project
  • Guides
    • 10  JupyterHub
    • 11  Python
    • 12  Pandas
    • 13  Altair
    • 14  Hypothesis Testing
    • 15  Modeling with Scikit-learn
    • 16  Debugging Your Code
  • Other Key Info
    • 17  How to Explain in CIS241
    • 18  R.A.D. C.A.T.
    • 19  Uploading a Jupyter Assignment
    • 20  Work Acquistion and Gender in the Museum of Modern Art
    • 21  Advanced Jupyter Setup
  1. Course Information
  2. 2  Course Schedule

2  Course Schedule

Date Topics & Slides Workshops Assignments Supplemental Reading
13 & 15 Jan. What Is Data? & Responsible Data Collection Jupyter Bruce Ch. 1
20 & 22 Jan. Python & Data Wrangling 0: Practice Python McKinney, Ch. 3, 5, and 8; Downey Ch. 7
27 & 29 Jan. Exploratory Data Analysis: Summary Statistics & Data Types 1: Movies 1 Documentation Assignment Due 29 Jan. Downey Ch. 2; McKinney Ch. 11
3 & 5 Feb. Exploratory Data Analysis: Visualization 1: Movies 2 Wilke Ch. 5 and UW Ch. 1, 2, 3, & 6
10 & 12 Feb. Hypothesis Testing: Comparison of Means 2: Sports 1 (Begin Test 1) Downey Ch. 11 and 13; Bruce Ch. 3
17 & 19 Feb. Hypothesis Testing: Correlation 2: Sports 2 Test 1 19 Feb. Downey Ch. 9
24 & 26 Feb. Supervised Learning: Regression 3: Business 1 Downey Ch. 10; McKinney Ch. 12.4; Bruce Ch. 4
3 & 5 Mar. Supervised Learning: Regression, cont. 3: Business 2 Bruce Ch. 6
17 & 19 Mar. Ethical Data Science; Project Meetings 3.5: Ethics Project Plan Due 19 Mar.
24 & 26 Mar. Supervised Learning: Classification 4: Health 1 Bruce Ch. 5
31 Mar. & 2 Apr. Supervised Learning: Classification, cont.; Project Meetings 4: Health 2 (Begin Test 2) Bruce Ch. 5 & 6
7 & 9 Apr. Unsupervised Learning: Clustering 5: Literature 1 Test 2 9 Apr. Bruce Ch. 7
14 & 16 Apr. Unsupervised Learning: Dimension Reduction 5: Literature 2 Video Presentations Due 16 Apr. Bruce Ch. 7
21 & 23 Apr. Discuss Projects & Videos, Sample Final Project
28 Apr. 2pm–3:50pm CIS Open House on Scholars Day! Final Project Due Fri. 1 May at 2pm

2.1 Textbooks:

There are no required textbooks for this course. All textbooks are free online or made available to you on Sakai.

  • McKinney, Python for Data Analysis
  • UW, Visualization Curriculum for Altair
  • Wilke, Fundamentals of Data Visualization
  • Downey, Elements of Data Science
  • Bruce, Bruce, and Gedeck, Practical Statistics for Data Scientists 2nd ed.

2.1.1 Note on the schedule

Keep in mind that some of this schedule could change throughout the semester. However, if anything changes I’ll update this page, and I’ll be sure to give you plenty of advance notice.

2.2 Software

All projects in this course will be scripted and analyzed using Python, an open source programming language and environment. Specifically, we will be using JupyterHub as our programming environment. No previous experience with Python, statistical software packages, or computer programming is required.

1  Homepage
3  Course Policies