Neural_Machine_Translation

Best model metrics

Metric	Score
Corpus BLEU	37.0347
Dev ppl	61.4084

You can find the model weights here

Try Translating yourself!

The translation demo is available here on Streamlit Sharing.
Even is you don't know Spanish you can use the demo as there is Google Translate which will help you to convert your English sentences to Spanish.

About NMT model

Hybrid Word-Character Seq2Seq Machine Translation

It is a Seq2Seq Model that translates Spanish sentences into English based on Luong et al. 2015.
It consists of a bidirectional LSTM encoder and unidirectional LSTM decoder.
It also uses attention mechanism to boost its performance on the translation task.
The pipeline and the implementations is inspired by the Open-NMT package.

The model becomes more powerful as we combine character-level with word-level language modelling.
The idea is that whenever the NMT model generates a <unk> token we run a character-level language model and generate a word in the output character by character.
This hybrid word-character approach was proposed by Luong and Manning 2016 and turned out to be effective in increasing the performance of the NMT model (+1.2 BLEU).

Installation

Install from source:

git clone https://github.com/sahilkhose/Neural_Machine_Translation
cd Neural_Machine_Translation
pip3 install -r requirements.txt

To run the translation demo:

streamlit run stream_translate.py

Or just go here on Streamlit Sharing.

Contributing

If you find a bug, create a GitHub issue, or even better, submit a pull request. Similarly, if you have questions, simply post them as GitHub issues.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figures		figures
outputs		outputs
project_pdfs		project_pdfs
sanity_check_en_es_data		sanity_check_en_es_data
trans_new		trans_new
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
char_decoder.py		char_decoder.py
cnn.py		cnn.py
collect_submission.sh		collect_submission.sh
gpu_requirements.txt		gpu_requirements.txt
highway.py		highway.py
local_env.yml		local_env.yml
model_embeddings.py		model_embeddings.py
nmt_model.py		nmt_model.py
packages.txt		packages.txt
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
sanity_check.py		sanity_check.py
stream_translate.py		stream_translate.py
translate.py		translate.py
utils.py		utils.py
vocab.json		vocab.json
vocab.py		vocab.py
vocab_tiny_q1.json		vocab_tiny_q1.json
vocab_tiny_q2.json		vocab_tiny_q2.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural_Machine_Translation

Best model metrics

Try Translating yourself!

About NMT model

Hybrid Word-Character Seq2Seq Machine Translation

Installation

Contributing

About

Releases

Packages

Languages

License

sahilkhose/Neural_Machine_Translation

Folders and files

Latest commit

History

Repository files navigation

Neural_Machine_Translation

Best model metrics

Try Translating yourself!

About NMT model

Hybrid Word-Character Seq2Seq Machine Translation

Installation

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages