Skip to content

Repository of my group final project that presented at the end of Digital Talent Scholarship 2021 training.

Notifications You must be signed in to change notification settings

azizamir/DTS-python-2021-final-project

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DTS-Python-2021-Final-Project

Overview

This project is my group's final project that was presented at the end of Digital Talent Scholarship 2021 training. This project is about web scrapping where our group is extracting Indonesian movie data from the IMDB website, analyzed, and visualized it using several python libraries to obtain an insight into the Indonesian movies industry.

Tools

  • Python version: 3.6.9
  • Packages and library: Scrapy, pandas, Matplotlib, Seaborn

Web Scraping

The web scraping method aims to extracting Indonesian movie data, specifically title, year, rating, director, and stars from IMDB website using Scrapy library that exported as CSV file format.

Exploratory Data Analysis

After the data was downloaded, we analyzed and visualized the data using pandas, Matplotlib, and seaborn to get an insight from the dataset, such as top 10 movies with the highest rating, top 10 most productive movie stars, top 5 most productive directors, movies created by the most productive director, movies rating data distribution from 1900's - 2000's, and movies rating vs year.

About

Repository of my group final project that presented at the end of Digital Talent Scholarship 2021 training.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.8%
  • Python 0.2%