Skip to content

Latest commit

 

History

History
32 lines (24 loc) · 1.44 KB

assignment-7.md

File metadata and controls

32 lines (24 loc) · 1.44 KB
layout title
page
Assignment 7 (30 points) - Due on Friday December 2nd

You can work as a two-member team for this assignment

For this assignment, you are to develop solutions for the following challenges using a Cypress Spark notebook. Each application can be developed as a series of Jupyter cells. Make sure that you properly separate different applications via Markdown cells.

For all applications, demonstrate the final results by displaying the first 20 output records

  1. Write a WordCount application so that punctuations and capitalization are no longer factors in determining unique words.

  2. Write an application to calculate the average rating of all movies over time and filter out any movie with averaged ratings less than 3.75 or rated less than 5000 times.

  3. From the output movies of challenge 2, count the genre distribution and place these counts in descending order.

Submission instructions:

Submission for this assignment is a Cypress Spark notebook containing:

  • Coding cells for each application with all output displayed
  • If there is any README information, it should be included in a Markdown cell at the top of the notebook.
  • If working as a team, a final Markdown cell that describes how the work is divided between the members of the team with justification (e.g., what each member knows and does not know about the assignment) must be added to the end of the notebook.

For each group, only one submission is needed.