Cleaning-Toronto-OpenStreetMap-Data

This code cleans OpenStreetMap data for Toronto. The purpose is to look for inconsistencies, uniformity and accuracy in the data. This is the final project for Udacity's data wrangling course in the data analyst nanodegree program

The project uses Python to clean, store and analyze the data. First the data is extracted from OpenStreetMap for a particular area, in this case Toronto. The map data is downloaded in a XML file. Using Python, we iterate through the data and first get an idea of what the data is like e.g. get a count of the number of node and way tags in the XML file. Then information from the tags are extracted, things like id, key and value. These values are stored in a database and then into a csv file. Then we search for inconsistencies in the data, finding patterns and providing insights into the data. Read the report1.pdf and code_for_analysis.py files to see the full process of cleaning this dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
code_for_analysis.py		code_for_analysis.py
map_link.txt		map_link.txt
mydv.db		mydv.db
nodes.csv		nodes.csv
nodes_tags.csv		nodes_tags.csv
references.txt		references.txt
report1.pdf		report1.pdf
toronto_sample.osm		toronto_sample.osm
ways.csv		ways.csv
ways_nodes.csv		ways_nodes.csv
ways_tags.csv		ways_tags.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cleaning-Toronto-OpenStreetMap-Data

About

Releases

Packages

Languages

kalvii045/Cleaning-Toronto-OpenStreetMap-Data

Folders and files

Latest commit

History

Repository files navigation

Cleaning-Toronto-OpenStreetMap-Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages