This data science project aims to build a real estate price prediction website. During model building we will cover concepts such as data loading, data cleaning, outlier detection and removal, feature engineering, dimensionality reduction, gridsearchcv for hyperparameter tunning, k fold cross validation, etc.
- Python
- Numpy and Pandas - Data Cleaning
- Matplotlib - Data Visualization
- Sklearn - Model Building
- Jupyter Notebook, Visual Studio Code and Pycharm - IDE
- Python Flask - HTTP Server
- HTML/CSS/Javascript - UI
- AWS
- We will first build a model using sklearn and linear regression using banglore home prices dataset from kaggle.com.
- We would write a python flask server that uses the saved model to serve http requests.
- We build the website using html, css and javascript that allows user to enter home square ft area, bedrooms etc and it will call python flask server to retrieve the predicted price.
- Create EC2 instance using amazon console, also in security group add a rule to allow HTTP incoming traffic
- Now connect to your instance using a command like this,
ssh -i "C:\Users\Viral\.ssh\Banglore.pem" ubuntu@ec2-3-133-88-210.us-east-2.compute.amazonaws.com
- nginx setup
- Install nginx on EC2 instance using these commands,
sudo apt-get update sudo apt-get install nginx
- Above will install nginx as well as run it. Check status of nginx using
sudo service nginx status
- Here are the commands to start/stop/restart nginx
sudo service nginx start sudo service nginx stop sudo service nginx restart
- Now when you load cloud url in browser you will see a message saying "welcome to nginx" This means your nginx is setup and running.
- Now you need to copy all your code to EC2 instance. You can do this either using git or copy files using winscp. We will use winscp. You can download winscp from here: https://winscp.net/eng/download.php
- Once you connect to EC2 instance from winscp, you can now copy all code files into /home/ubuntu/ folder. The full path of your root folder is now: /home/ubuntu/BangloreHomePrices
- After copying code on EC2 server now we can point nginx to load our property website by default. For below steps,
- Create this file /etc/nginx/sites-available/bhp.conf. The file content looks like this,
server { listen 80; server_name bhp; root /home/ubuntu/BangloreHomePrices/client; index app.html; location /api/ { rewrite ^/api(.*) $1 break; proxy_pass http://127.0.0.1:5000; } }
- Create symlink for this file in /etc/nginx/sites-enabled by running this command,
sudo ln -v -s /etc/nginx/sites-available/bhp.conf
- Remove symlink for default file in /etc/nginx/sites-enabled directory,
sudo unlink default
- Restart nginx,
sudo service nginx restart
- Now install python packages and start flask server
sudo apt-get install python3-pip
sudo pip3 install -r /home/ubuntu/BangloreHomePrices/server/requirements.txt
python3 /home/ubuntu/BangloreHomePrices/client/server.py
Running last command above will prompt that server is running on port 5000. 8. Now just load your cloud url in browser and this will be fully functional website running in production cloud environment