Algorithm-Generated Steno Dictionary

This project is designed to answer the question "Can we make a decent stenography dictionary with an algorithm and free data?

Status

Status: very early development stages.

Currently it just parses a wiktionary dump gets pronunciations from espeak (no steno anything yet).

Download a Wiktionary data dump: https://dumps.wikimedia.org/mirrors.html -> dumps/ -> enwiktionary/ -> DATE -> enwiktionary-DATE-pages-articles.xml.bz2

Option 1: install docker, then run: ./run-in-docker.sh -i enwiktionary.xml.bz2 -o output.ndjson

Option 2: install node.js and espeak, then run: ./run.sh -i enwiktionary.xml.bz2 -o output.ndjson

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Dockerfile		Dockerfile
README.md		README.md
index.js		index.js
package.json		package.json
run-in-docker.sh		run-in-docker.sh
run.sh		run.sh