Skip to content

Latest commit

 

History

History

features_intro

Understanding How Project Geospatial Features are Derived from OpenStreetMap

Many projects in AidData's Global Chinese Development Finance Dataset which have a physical footprint - such as roads, airports, power plants, or hospitals - have been mapped using OpenStreetMap (OSM). OpenStreetMap is a free, editable map of the whole world that is being built by volunteers largely from scratch and released with an open-content license (wiki). For projects with a physical footprint, AidData either searches the OSM database for existing features (e.g., Wattay International Airport in Laos) or generates driving directions associated with road projects (e.g., the N’Zeto-Soyo Highway Project in Angola). Once one or more features are found (or directions generated) which are associated with a project, the web links to the associated pages are saved so that the geospatial features can be extracted.

Wattay International Airport, Laos

N’Zeto-Soyo Highway Project, Angola

Before features can be extracted from OSM pages, we must understand the different types of data available through OSM. OSM features consist of 3 elements: nodes, ways, and relations. Driving directions will be treated as a separate fourth type that we address shortly. Nodes define points, ways define linear features (i.e., a line) or area boundaries (i.e., a polygon), while relations are a more complex description of how other elements can be combined together. Relations can include various combinations of nodes, ways, and other relations as members. Given the nature of relations, and since an individual project may consist of multiple OSM features, it is common that multiple geospatial feature types (points, lines, polygons, and multi-feature versions of each) are associated with a single project. This presents a challenge when attempting to produce a uniform data model for our dataset.

To overcome this issue, all point and line features are buffered slightly (approximately one meter) to transform them into polygons. To account for multiple features - all of which are now polygons - associated with a single project, a single multi-polygon feature is created which combines the polygons. In order to avoid producing invalid geometries, such as ones that have self-intersections, all polygons associated with a project are dissolved prior to generating the final MultiPolygon. ESRI's ArcGIS Resource Center provides additional details on these types of geospatial features.

Separate Point, LineString, and Polygon features

Combined MultiPolygon

The final collection of MultiPolygons is then exported using the GeoJSON and GeoPackage file formats for geospatial data.