Formula One ML & Analytics

Jupyter notebooks with the automatized process to obtain the necessary data from F1 API, resulting in Looker interactive dashboards.

Course: Web Analytics
Final grade: 9.2/10

Tools and covered topics

Jupyter Notebook
Google Looker Studio
Python libraries: numpy, pandas, sqlearn, matplotlib, seaborn
Manipulation of temporal series and prediction: AutoRegressive Integrated Moving Average (ARIMA)
ML models to compare their feature importance: MLPs, Gradient Boosting, DTs, Random Forest, Linear Regression

Objective

The primary goal is to work with Formula One API Ergast (see Documentation here), as well as to perform Data Analytics and a comparison of Machine Learning techniques from the dataset extracted.

Key Features

Prediction of the champion in the next season
Driver Performance Analysis: Explore how drivers perform across different races and seasons.
Race Strategy Optimization: Analyze race data to optimize pit stop strategies and track performance.
Team Comparison: Compare the performance of different F1 teams over time.
Data Visualization: Visualize complex F1 data in an intuitive Looker dashboard.

Expected Results

The final result should represent accurately the statistics and outputs of the ML models for the Formula One drivers, cars and teams in a Looker Dashboard.

The ML models to compare its feature importances are:

Random Forest
Linear Regression
MLP
Gradient Boosting
Decision Trees

Some examples of analytics metrics performed are:

Clustering of drives thourghout F1 history
Lap times in each circuit along history
Boxes and result per circuit and year
Evolution of the champions' points per year

Evaluation Criteria

The project will be evaluated based on the following criteria:

Code Execution: Your code must run without errors.
Scalability: Your implementation should be scalable, tested with datasets ten times larger on a cluster with 80 execution workers.
Documentation: Adequate code documentation is required.
Complexity of the Machine Learning techniques: Advanced techniques must be performed.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request with your changes. Feel free to customize it further to match your repository's specific details and needs!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Formula One ML & Analytics

Contents

Tools and covered topics

Objective

Key Features

Expected Results

Evaluation Criteria

Contributing

Files

README.md

Latest commit

History

README.md

File metadata and controls

Formula One ML & Analytics

Contents

Tools and covered topics

Objective

Key Features

Expected Results

Evaluation Criteria

Contributing