House Price Predictor

Overview

This repository contains a project aimed at predicting house prices using machine learning techniques. The project involves preparing data, training a model, evaluating its performance, building a Flask application, and deploying it on AWS.

Introduction

The House Price Predictor project utilizes machine learning to forecast house prices based on various input features. The project demonstrates data preprocessing, model training, and evaluation using Python. Additionally, it includes building a Flask application for serving the model and deploying the application on AWS.

Features

Predict house prices from the Number of Private Rooms feature.
Utilizes Python libraries like pandas, scikit-learn, matplotlib, and seaborn.
Includes data preprocessing, model training, evaluation, building a Flask application, and deployment on AWS.

Installation

To set up this project, follow these steps:

Clone the repository:

git clone https://github.com/AnoopGeorge418/House-Price-Predictor.git
cd House-Price-Predictor

Install the required dependencies: Create a virtual environment and install the necessary packages:

conda create -p House-env python==3.11 -y
conda activate House-env/
pip install -r requirements.txt

Usage

To train, test, and deploy the model, follow these instructions:

Prepare your data:
- Ensure your dataset matches the expected format. Place your data file in the data directory.

Run the training script:

python train_model.py
- This script will preprocess the data, train the model, and save it.

Build and run the Flask application:

Start the Flask server:

python app.py
- The application will be available at http://127.0.0.1:5000/.

Deploy the Flask application to AWS:
- Follow the instructions in the aws-deployment.md file for deploying the Flask application to AWS Elastic Beanstalk and codepipeline.

Data

The dataset should include features relevant to house pricing. Example columns: Number_of_Private_Rooms
Ensure the data is preprocessed to handle missing values, encode categorical variables, and scale numerical features.

Model

The model used is a Linear Regression model implemented with scikit-learn.

Training

The model is trained using train_model.py:

Loads and preprocesses data
Splits data into training and test sets
Trains the Linear Regression model
Evaluates performance
Saves the trained model

Evaluation

Performance metrics such as Mean Squared Error (MSE) and R-squared are used to evaluate the model.

Results

After running train_model.py, the model's performance metrics are displayed, and the trained model is saved in the models directory with a .pkl extension.

Contributing

Contributions are encouraged. To contribute:

Fork the repository.
Create a new branch for your changes.
Commit your changes.
Push to your fork.
Create a pull request to the original repository.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.ebextensions		.ebextensions
model		model
notebooks		notebooks
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
application.py		application.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

House Price Predictor

Overview

Table of Contents

Introduction

Features

Installation

Usage

Data

Model

Training

Evaluation

Results

Contributing

License

About

Releases

Packages

Languages

License

AnoopGeorge418/House-Price-Predictor

Folders and files

Latest commit

History

Repository files navigation

House Price Predictor

Overview

Table of Contents

Introduction

Features

Installation

Usage

Data

Model

Training

Evaluation

Results

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages