Skip to content

This repository contains my data visualization project for worldwide movie series using Python

Notifications You must be signed in to change notification settings

seuwenfei/Data-visualization-for-worldwide-movie-series

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

Data Visualization for Worldwide Movie Series

Introduction

With the growing popularity of youtube, amazon, netflix and more internet-hosting video sites, everyone has probably heard the term "web series". Web series are like TV shows, with the exception that each episode (or "webisode") broadcast online rather than on your television. The objective of this notebook is to represent information graphically, highlighting patterns and trends in data and helping to gain quick insights. The dataset is collected from Kaggle, which contains more than 3000 series that were scrapped from 100 web pages.

The dataset consists of 9 variables:

  • Unnamed: 0: index number
  • Name: series name
  • Year: year(s) a series broadcast
  • Tv Certificate: TV-Certificate a series rated
  • Duration per episode: per series length
  • Genre: series theme category
  • Ratings: series ratings in 1-10
  • Actor/Actress: casts
  • Votes: how many individuals vote for a series

Python Libraries

pandas, numpy, seaborn, matplotlib, wordcloud

combine_images

Read the complete Data Visualization for Worldwide Movie Series project here.

Conclusion

The following series are more likely to receive high or very high ratings:

  • TV Certificate: TV-MA and TV-14
  • Duration per episode: 47.0 to 62.9 minutes
  • Year Span: 5.6 t0 15.3 years
  • Genre: (Crime, Drama, Mystery), (Crime, Drama, Thriller), (Action, Crime, Drama) and (Thriller)

About

This repository contains my data visualization project for worldwide movie series using Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published