Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crosslink detection: consider taking simple sound changes into account #11

Open
stscoundrel opened this issue Jan 22, 2023 · 1 comment
Assignees

Comments

@stscoundrel
Copy link
Owner

Simple example: from Old Norse word "hvalr", there would later be descendant "hval", simply by dropping the ending r. This would be common pattern that may be easy to recognize, allowing links from west norse to east norse words (=west generally kept -r longer)

See if this kind of feature:

  • Produces respectable amount of crosslinks without too many false positives. Should there be too few, we may be better off just using manual overrides
  • Has decent enough performance. Naive way could easily end up doing quite a bit of comparisons, as there are some 120K+ entries in the dictionaries.
@stscoundrel stscoundrel self-assigned this Jan 22, 2023
@stscoundrel
Copy link
Owner Author

stscoundrel commented Jan 29, 2023

Another common one seems to be writing eth of western dialects as "dh" in Old Swedish. For example, various words containing faðir reliably turning into fadhir

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant