berlin_rental_prices

Installation

To webscrape new listings install Scrapy: $ pip install scrapy
To follow my analysis in the Jupyter Notebbok install the following packages:
- Tabula to extract information fom PDFs: $ pip pip install tabula-py
- FuzzyWuzzy to match strings: $ pip install fuzzywuzzy
- Plotly for visualizations in the Jupyter Notebook: $ pip install plotly
- Chart studio if you want to export your Plotly Visualisations: $ pip install chart_studio

Project Motivation

To stop the ever increasing costs of housing, the Berlin state government passed a controversial law that caps the rent. On February 23, the new law will come into effect.

As my next data science side project, I decided to analyse current online listings on ImmobilienScout24, to see whether current landlords already respect the new rent cap.

How many listings had a higher price than the allowed rent cap?
How much more per month would all tenants pay than they had to under the new rent cap?
How much would the average cold rent decrease under the new law?
What is the distribution of the excess rent under the new law?
How would the average cold rent price change per district?
Which big real estate firms are charging the most excess rent?

File Descriptions and Getting Started

To run the spider/web crawler:

Go to the base folder berlin_rental_prices and change to the subfolder berlin_rental_prices ($ cd berlin_rental_prices)
Run the spider in your terminal with scrapy crawl immo_scraper -o your_file_name.csv

To run the jupyter noteebook:

Navigate to the following folder: berlin_rental_prices -> berlin_rental_prices -> berlin_rental_prices
Open the data_analysis.ipynb Jupyter Notebook

Results

The main findings of the code can be found at the post available here.

Licensing, Authors, Acknowledgements

Must give credit to the author for the data. Otherwise, feel free to use the code here as you would like!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
img		img
rental_prices		rental_prices
.gitignore		.gitignore
README.md		README.md
Strassenverzeichnis2019.pdf		Strassenverzeichnis2019.pdf
cleaned_abreviations.csv		cleaned_abreviations.csv
data_analysis.ipynb		data_analysis.ipynb
straßenverzeichnis.csv		straßenverzeichnis.csv
to_be_cleaned_abreviations.csv		to_be_cleaned_abreviations.csv
wohnlagenkarte_berlin.csv		wohnlagenkarte_berlin.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

berlin_rental_prices

Table of Contents

Installation

Project Motivation

File Descriptions and Getting Started

Results

Licensing, Authors, Acknowledgements

About

Releases

Packages

Languages

feliche93/berlin_rental_prices

Folders and files

Latest commit

History

Repository files navigation

berlin_rental_prices

Table of Contents

Installation

Project Motivation

File Descriptions and Getting Started

Results

Licensing, Authors, Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages