Skip to content

JasonWoof/steno-dictionary-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Algorithm-Generated Steno Dictionary

This project is designed to answer the question "Can we make a decent stenography dictionary with an algorithm and free data?

Status

Status: very early development stages.

Currently it just parses a wiktionary dump gets pronunciations from espeak (no steno anything yet).

How To Run It

Download a Wiktionary data dump: https://dumps.wikimedia.org/mirrors.html -> dumps/ -> enwiktionary/ -> DATE -> enwiktionary-DATE-pages-articles.xml.bz2

Option 1: install docker, then run: ./run-in-docker.sh -i enwiktionary.xml.bz2 -o output.ndjson

Option 2: install node.js and espeak, then run: ./run.sh -i enwiktionary.xml.bz2 -o output.ndjson

About

Generate a steno dictionary from free data and code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published