Skip to content

SilverScreenAnalytics is a data analysis project that explores a movie dataset to uncover patterns and trends in the film industry. It analyzes variables such as budget, revenue, popularity, and vote averages, with insights on the most expensive and profitable movies, genre distribution, and relationships between key factors.

Notifications You must be signed in to change notification settings

FaithMnisi/SilverScreenAnalytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

SilverScreenAnalytics 🎬

Overview

SilverScreenAnalytics is a project focused on analyzing a comprehensive movie dataset to explore patterns, trends, and relationships within the data. The aim was to uncover meaningful insights and conduct exploratory analysis on various aspects of the movies included in the dataset.

Dataset Description

The dataset contained the following columns:

  • budget: The production budget of the movie
  • genres: Genre(s) associated with the movie
  • homepage: Official homepage URL
  • id: Unique identifier for the movie
  • keywords: Key themes or tags associated with the movie
  • original_language: Language in which the movie was originally released
  • original_title: Original title of the movie
  • overview: Brief summary of the movie's plot
  • popularity: Popularity score based on various factors
  • production_companies: Companies involved in the movie's production
  • production_countries: Countries where the movie was produced
  • release_date: Official release date
  • revenue: Total revenue generated by the movie
  • runtime: Movie duration in minutes
  • spoken_languages: Languages spoken in the movie
  • status: Release status (e.g., Released, Post-Production)
  • tagline: Promotional tagline for the movie
  • title: Movie title
  • vote_average: Average audience rating
  • vote_count: Number of votes received

Methodology

Data Cleaning

  • Removed redundant rows and columns.
  • Handled missing data appropriately.
  • Changed data types for consistency and analysis.
  • Flattened JSON columns for easier exploration and analysis.

Exploratory Analysis

  1. Action Genre Exploration: Focused on movies within the action genre, analyzing patterns specific to this category.
  2. Budget and Profitability:
    • Identified the top 5 least and most expensive movies.
    • Analyzed the top 5 most profitable movies.
  3. Popularity and Ratings:
    • Conducted popularity analysis to identify trends.
    • Explored movies rated above 7 to identify quality films.
  4. Genre Frequency: Investigated the frequency of movies in each genre.
  5. Comparative Analysis: Compared profits, revenue, and budgets for the top 5 most expensive and most profitable movies.
  6. Relationships Between Variables: Explored correlations and relationships between popularity, vote average, and profit.

Insights

The exploration highlighted interesting trends and patterns, including how budget and profitability relate to popularity and audience ratings, as well as the distribution of movies across genres. These findings serve as a foundation for further analysis and hypothesis generation.

About

SilverScreenAnalytics is a data analysis project that explores a movie dataset to uncover patterns and trends in the film industry. It analyzes variables such as budget, revenue, popularity, and vote averages, with insights on the most expensive and profitable movies, genre distribution, and relationships between key factors.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published