- Understanding the Problem Statement
- Data Exploration
- Data Visualization
- Data Cleaning
- Data Preprocessing
- Model Building
- Model Evaluation
- Choosing Best Machine Learning Model
- Save the Best Machine Learning Model
- Make prediction with Best Machine Learning Model
- Creating a Web Interface for User Predictions
This project explores how students' performance (exam scores) is influenced by variables such as Gender, Ethnicity, Parental Level of Education, Lunch Type, and Test Preparation Course.
- The dataset used in this project is obtained from Kaggle.
- The dataset contains 8 columns and 1000 rows.
Attribute | Description | Possible Values |
---|---|---|
gender | Sex of students | Male, Female |
race_ethnicity | Ethnicity of students | Group A, Group B, Group C, Group D, Group E |
parental_level_of_education | Parents' final education | Bachelor's degree, Some college, Master's degree, Associate's degree, High school |
lunch | Type of lunch before test | Standard, Free/reduced |
test_preparation_course | Completion of test preparation course | Complete, Not complete |
math_score | Score in math | Numeric value |
reading_score | Score in reading | Numeric value |
writing_score | Score in writing | Numeric value |
- setuptools
- numpy
- pandas
- matplotlib
- seaborn
- openpyxl
- scikit-learn
- xgboost
- catboost
The project code includes data ingestion, preprocessing, model building, evaluation, and deployment steps. A web interface is provided to allow users to input their data and receive predictions.
smp.mp4
The web interface built using Flask allows users to input various parameters to predict a student's math score. The interface is designed to be user-friendly and interactive.
Feel free to explore the project and use the interface to see how different variables can affect student performance in exams.