Data Mining Project 2022

Business objective

Our objective is to identify popular movies to invest in US-produced movies’ copyrights that will likely have a high ROI, as measured by popularity amongst movie-goers.

General approach

In this project, we tested multiple supervised predictive models and dived into a detailed examination of the top three models: XGBRegressor, GradientBoostingRegressor, and RandomForestRegressor. We expect to measure performance using adjusted R2(given the number of features)and RMSE.Based on our analysis, we believe ourXGBoostmodel with the predictors explains 69% of the variation in log-transformed target variable and as measured by adjustedR2.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
notebooks		notebooks
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Mining Project 2022

Business objective

General approach

The data directory has the small datasets used. The ipynb and html versions of the code are in 'notebooks'.

About

Releases

Packages

Languages

vrittigandhi/data_mining_project_22

Folders and files

Latest commit

History

Repository files navigation

Data Mining Project 2022

Business objective

General approach

The data directory has the small datasets used. The ipynb and html versions of the code are in 'notebooks'.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages