Welcome to the Car Price Prediction repository! This project focuses on predicting car prices using multiple linear regression techniques. The goal is to model the price of cars based on various features to understand the factors influencing car pricing.
In this project, we use a dataset containing information about different cars to build and evaluate a multiple linear regression model. The model helps in predicting car prices by analyzing the relationships between the price and other car attributes.
- Jupyter Notebook: A well-commented notebook that details the entire process of building and evaluating the multiple linear regression model.
- CSV Files: Contains the dataset used for training and evaluating the model.
- PDF Document: A PDF with important concepts related to multiple linear regression to help you understand the theoretical background.
The dataset used in this project has the following columns:
- Car_ID: Unique id of each observation (Integer) 🚀
- Symboling: Assigned insurance risk rating. A value of +3 indicates that the auto is risky, -3 that it is probably pretty safe. (Categorical) 🛡️
- carCompany: Name of the car company (Categorical) 🏢
- fueltype: Car fuel type (gas or diesel) (Categorical) ⛽
- aspiration: Aspiration used in a car (Categorical) 🌬️
- doornumber: Number of doors in a car (Categorical) 🚪
- carbody: Body type of the car (Categorical) 🚗
- drivewheel: Type of drive wheel (Categorical) 🚙
- enginelocation: Location of the car engine (Categorical) 🏎️
- wheelbase: Wheelbase of the car (Numeric) 📏
- carlength: Length of the car (Numeric) 📐
- carwidth: Width of the car (Numeric) 📏
- carheight: Height of the car (Numeric) 📏
- curbweight: The weight of a car without occupants or baggage (Numeric) ⚖️
- enginetype: Type of engine (Categorical) 🔧
- cylindernumber: Number of cylinders in the car (Categorical) 🔩
- enginesize: Size of the engine (Numeric) 🔧
- fuelsystem: Fuel system of the car (Categorical) ⛽
- boreratio: Bore ratio of the car (Numeric) 📏
- stroke: Stroke or volume inside the engine (Numeric) 📏
- compressionratio: Compression ratio of the car (Numeric) 🧮
- horsepower: Horsepower (Numeric) 🏋️
- peakrpm: Peak RPM of the car (Numeric) ⏱️
- citympg: Mileage in the city (Numeric) 🚦
- highwaympg: Mileage on the highway (Numeric) 🚗
- price: Price of the car (Dependent Variable) 💵
A Chinese automobile company, Geely Auto, is planning to enter the US market. They want to understand the factors that affect car pricing in the American market compared to the Chinese market.
- Identify Significant Variables: Determine which variables are most significant in predicting car prices.
- Understand Pricing Dynamics: Use the model to understand how car prices vary with different independent variables.
- Strategic Planning: Utilize the model to make informed decisions regarding car design, pricing, and business strategies in the new market.
- Python 3.x
- Jupyter Notebook
- pandas
- numpy
- scikit-learn
- matplotlib
- seaborn
For detailed explanations and comments on the notebook, refer to the well-documented Jupyter Notebook included in the repository.
For any questions or feedback, feel free to reach out to me at akashanandani.56@gmail.com