Assignments for this course will be programming exercises, and most of the content will be related to data analysis. Please refer to the Assignment Schedule for the release and submission dates. Assignments are to be completed using the Python programming language. All assignments will be posted to this section approximately 2 weeks before they are due, and will be submitted through Quercus (more detail on this will be included in the assignment files).

  • Assignment documents containing the questions and other instructions will be distributed for each assignment. There will also be notebooks for Google Colab, an interactive programming environment that automatically includes most standard Python packages.

  • Note: Even though some assignments may appear short, they take longer than expected to complete. Please do not leave things until the last minute. The first assignment will be posted after the second lecture. You can find the dates under the ‘Schedule’ section of the course website.


Assignment 1 (Data Visualization)

Instructions

Due Jan 31st at 10:59am

Reflection 1

Instructions

Due Feb 2nd at 10:59am

Homework 2 (Linear Regression)

Instructions

Due Feb 14th at 10:59am

Reflection 2

Instructions

Due Feb 16th at 10:59am

Homework 3 (Simulation)

Instructions

Due Mar 11th at 10:59am

Homework 4 (NLP)

Add your group members to this sheet by 10:59am on Mar 14th. Worth 5 pts.

Add your presentation time choice to this sheet beginning noon of 3/22

Instructions

Here are the Part I datasets: training | testing