housingprediction

Using Machine Learning Methods to Predict Housing Prices in Lodi I'm looking at houses that were recently sold in the Lodi area (95240, 95242, 95209, and 95219 so far) and using different machine learning algorithms to predict housing prices on houses that are currently in the market. I'm a licensed real estate agent and have access to MetroList, so that's where I'm getting my data from. So far, I've tried to predict prices using Random Forests, Decision Tree Regression, Gradient Boosted Regression, XGBoost Regression, Linear Regression, and Neural Networks. Gradient Boosted Regression and XGBoost work the best with the training data I have.

I use pandas to read the data and notice that there are missing values, as well as data that needs to be converted from strings to some meaningful form in either integer or float.

The 15 Features that are being trained and looked at are the following:

-Listing Price
-Days on Market 
-Price Per Square Feet
-Square Footage 
-Lot Size (Square Feet)
-Year Built 
-Year Being Sold
-# of Garage Spaces
-Bedrooms
-Home Association Dues
-Total Bathrooms
-Bathroom-Full
-Bathroom-Half
-Pool
-Zip Code

Some of the values of Lot Size and Year Built are missing or are inputted as 0, so I looked at the median of the training data grouped by the Zip Code the house is located in. Most of the information of what I did to predict housing prices can be found in the Jupyter Notebook (housingpricepredictions.ipynb).

Last Update: 12/15/2016

UPDATE: 1/10/2017 Included a general script (pricepredict.py) with a larger dataset (in the data folder) of about 75,000 training sample points. The general script imputes missing data based on the different zip codes, where now nothing is hard coded, but generalized to the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
README.md		README.md
XGBoostPredictions.csv		XGBoostPredictions.csv
decisionTreePredictions.csv		decisionTreePredictions.csv
gradientBoostedPredictions.csv		gradientBoostedPredictions.csv
housingpricepredictions.ipynb		housingpricepredictions.ipynb
pricepredict.py		pricepredict.py
simplemodel.py		simplemodel.py
sold3.csv		sold3.csv
testing3.csv		testing3.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

housingprediction

About

Releases

Packages

Languages

dshundal94/housingprediction

Folders and files

Latest commit

History

Repository files navigation

housingprediction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages