This repository contains the code used to generate a submission that won 3rd place in the Data Visualization Challenge organized by the United Nations Development Programme Human Development Reports.
The submission lives at income-inequality.info
The website files are located in another GitHub repository: Income-Inequality
To launch a live interactive version:
To report errors, ask questions, or offer corrections, please use the Issues section on this or the other repository.
I tried to document my steps and processing. Please let me know if you'd like me to clarify, elaborate, or better-organize this repository.
There were very few steps I had to perform manually (without running a script), primarily cleaning the .csv file headers for the Pandas library to be able to read the files. Another manual step was dealing with the data for China, which was split into two categories: urban and rural. Only four countries had to be removed from the original dataset before the final export (there was trouble with PPP conversion).
Files are numbered for convenience and in rough chronological order:
01 - Generates the 1st, 2nd, and 3rd figures on the website
02 - Generates the 4th - 8th figures, export csv with coordinate data for the interactive graph
03 - Process csv from above step into correct format for interactive graph library
04 - Generate the js dictionaries to convert between ISO2, ISO3, and country names on the website
See the archive folder for the initial exploration with 2005 data
Coding is likely less scary than you think. Getting everything above to work will just take some research and patience.
The above code requires you install:
- Python (version 3.6 or better yet 3.7)
- this will automatically install pip too
- Open your terminal and use pip to install*:
- jupyterlab (a newer version of Jupyter Notebooks)
- pandas
- matplotlib
- seaborn
- Save all the files from this repository
- if you install git you can just use the command
git clone https://github.com/whyboris/Global-Income-Distribution.git
in your terminal
- if you install git you can just use the command
- Finally navigate to this folder in your terminal and type in
jupyter lab
- Your browser should open the Jupyter Lab interface and you'll be able to open and run the files!
* Using pip
to install above is as easy as typing pip install pandas
in your terminal. Though if you start coding more-seriously, consider using virtual environments as well.
When unsure about how to do something, just search online, and on stackoverflow.