This repository consists of an anaylsis undertaken to better understand the experience around Remote Work for developers. To read more the findings please checkout the associated medium article.
The data used is the Annual Developer Survey collected by Stackoverflow for the year 2019 (latest available). It contains responses of nearly 90,000 participants from over 170 countries and dependent territories. According to Stack Overflow, 2019 Annual Developer Survey examines all aspects of the developer experience from career satisfaction and job search to education and opinions on open source software.
To get started please download the data (~200MB) from here and add it under 'data/' directory.
The project was developed using python 3.6.7 with the following packages.
- Pandas
- Numpy
- Matplotlib
- Seaborn
Installation with pip:
pip install -r requirements.txt
- AnalysisReport.ipynb : Jupyter Notebook with all the workings including data preparation, analysis and findings.
- requirements.txt : pre-requiste libraries for the project
- ouput/ : images from Data Visualization
The questions answered regarding remote work through data in the analysis are
- Are developers who work remotely more satisfied with the job?
- Do remote workers earn more?
- In what type of organisations are remote work popular?
- What are the most common challenges faced when working remotely?
This project employed CRISP-DM, a popular methodology for solving data science problems. Business Questions were raised and and an understanding of the data was achieved in the beginning. The data was prepared for the relevant questions, and throughly analysed to obtain findings. The findings were validated and the results were communicated in the form of an blog post.
Stack Overflow, for collecting, cleaning and providing the data for analysis.