Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
-
Updated
Aug 7, 2024 - Python
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Morfessor is a tool for unsupervised and semi-supervised morphological segmentation
Properly handle position-dependent phones in a subword lexicon FST
Morfessor FlatCat
Morfessor EM+Prune
Finalfusion embeddings in Python
A python package to build a corpus vocabulary using the byte pair methodology and also a tokenizer to tokenize input texts based on the built vocab.
Cognate-aware morphological segmentation
A tool for generating sub-word (phone or grapheme) level embeddings from an HTK-style MLF ASR corpus
Morfessor EM+Prune
Morfessor demonstration
Subword Neural Machine Translation
🕵️ Language Model based on RNN for generating Sherlock Holmes stories.
Add a description, image, and links to the subword-units topic page so that developers can more easily learn about it.
To associate your repository with the subword-units topic, visit your repo's landing page and select "manage topics."