morph-seq2seq

My reimplementation of LMU's MED system. Note that this reimplementation is already outdated. I've included the med.yml for reference if anyone's curious, but the underlying library from IBM is no longer supported.

It requires pytorch and pytorch-seq2seq (this is my mocked up version---temp fix), , so make sure you have those installed.

We also now have git flow. Develop will rapidly change and frequently break. Master and release should be more stable. Just a forewarning though, this is ALL VERY ALPHA STILL.

Big todos:

preprocess.py SIGMORPHON_FILE generates data.txt, vocab.source, and vocab.target. make_splits.py data.txt generates train, dev, and test splits.

data.txt --- Is a table separated file with with source on the left and targets on the right.
vocab.* files have exactly one vocab element

./main.py --config config.yml Runs the model in train or eval mode.

Current sigmorphon2016 German dev score:

Val accuracy: 1524 out of 1597 0.9542892924232936

acc on sig set sans adjectives:

Val accuracy: 609 out of 673 0.9049034175334324

Russian acc with word vectors: Val accuracy: 1453 out of 1591 0.9132620993086109

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
data		data
pdms		pdms
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
UD2MED.py		UD2MED.py
accs.py		accs.py
affixcheck.py		affixcheck.py
check.py		check.py
class_dist.py		class_dist.py
clean.py		clean.py
detokenizer.py		detokenizer.py
entropy_calc.R		entropy_calc.R
error_analysis.ipynb		error_analysis.ipynb
error_analysis.py		error_analysis.py
extract.py		extract.py
main.py		main.py
make_splits.py		make_splits.py
maxent_test.py		maxent_test.py
mcnemar.py		mcnemar.py
micha_condit.py		micha_condit.py
notes.txt		notes.txt
novecs.yml		novecs.yml
prepUDseqs.py		prepUDseqs.py
preprocess.py		preprocess.py
pullerrors.py		pullerrors.py
pullright.py		pullright.py
pytorch-seq2seq.yml		pytorch-seq2seq.yml
recompile.sh		recompile.sh
scramble.py		scramble.py
sortskies.py		sortskies.py
troubleshoot.py		troubleshoot.py
udN2sig.py		udN2sig.py
uni-balance.py		uni-balance.py
vectors.yml		vectors.yml
zh_morph.py		zh_morph.py
zh_morph_acc.py		zh_morph_acc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

morph-seq2seq

About

Releases

Packages

Languages

License

DavidLKing/MED-pytorch

Folders and files

Latest commit

History

Repository files navigation

morph-seq2seq

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages