A QA tool for the OpenStreetMap database to detect the non-Unicode Zawgyi text encoding for Burmese (Myanmar) language.
View the daily-generated data in the Viewer at bdon.github.io/OpenStreetMap-BurmeseEncoding
Without a consistent text encoding, it is impossible for application developers to display text correctly across the world. Note that software like Mapnik and Canvas-based renderers can draw Burmese Unicode text correctly; MapLibre GL JS currently cannot.
Because Zawgyi occupies the same codepoints in Unicode, it must be classified using probability.
This tool uses:
- Google's myanmar-tools classifier
- paulmach/osm for parsing OSM
- Rabbit for suggesting Zawgyi->Unicode conversions
Do not automate edits to OpenStreetMap data.
- Create an account on openstreetmap.org
- Click the link to an OSM Object on the Viewer
- Edit text using the web editor
- Add
#zawgyi
to your changeset description.
BSD-2-Clause, except for Rabbit which is pending license.