This project is a part of Capstone Project during my Data Engineering Certification Training with The Center of Applied Data Science (CADS). A group project of 5 people led by me, this project focuses on Data Cleaning and Data Prediction using Machine Learning on Credit Card Customers data1. The process involved in this project includes:
- Data wrangling with Python, Pandas.
- Exploratory Data Analysis and visualization with seaborn and matplotlib.
- Modelling using scikit-learn (RandomForestRegressor).
All process was compiled in this Jupyter notebook.
📌 Note from Author: As this is one of my first projects ever, it's admittedly a bit messy. I chose not to alter any as a reference for my growth in the future.
Footnotes
-
Data used is a mock data and have been permitted to be used in students' portfolio. ↩