Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 625 Bytes

README.md

File metadata and controls

13 lines (11 loc) · 625 Bytes

WORK IN PROGRESS Scripts to generate abbreviations from wordlists for use in transcription.

Rules for generating abbreviations

  1. Remove vowels unless they occur at the start of the word
  2. Abbreviation cannot collide with common 2 or 3 letter words

Sources: 30k.txt from derekchuank/high-frequency-words 20k.txt from first20hours/google-10000-english 8k.txt from http://www.use-in-a-sentence.com/english-words/10000-words/the-most-frequent-10000-words-of-english.html 2letter.txt from https://www.logicofenglish.com/spelling-lists/by-phonogram/331-two-letter-words 3letter.txt from https://www.totallystupid.com/?what=3