You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Originally posted by Ksaspar March 12, 2024
While it is great that monkeytype offers Estonian language, the word set is sub-bar. It seems to be created from media texts and contains a lot of person names, some capital letters and abbreviations.
I found a better word corpus here: https://www.cl.ut.ee/ressursid/sagedused1/
It contains 5 million words from journalism, fiction and scientific text each. I can convert the word lists into appropriate format if someone is willing to add them.
Here are the word lists:
Estonian 200 - pastebin.com/eZ18aSBB
Estonian 1k - pastebin.com/NsbjM7ME
Estonian 5k - pastebin.com/dmfaAsR0
Estonian 10k - pastebin.com/DUFyKfN5
Use the linked lists to update estonian files.
The text was updated successfully, but these errors were encountered:
Discussed in #5217
Originally posted by Ksaspar March 12, 2024
While it is great that monkeytype offers Estonian language, the word set is sub-bar. It seems to be created from media texts and contains a lot of person names, some capital letters and abbreviations.
I found a better word corpus here: https://www.cl.ut.ee/ressursid/sagedused1/
It contains 5 million words from journalism, fiction and scientific text each. I can convert the word lists into appropriate format if someone is willing to add them.
Here are the word lists:
Estonian 200 - pastebin.com/eZ18aSBB
Estonian 1k - pastebin.com/NsbjM7ME
Estonian 5k - pastebin.com/dmfaAsR0
Estonian 10k - pastebin.com/DUFyKfN5
Use the linked lists to update estonian files.
The text was updated successfully, but these errors were encountered: