Attention Is All You Need (Transformer)

PyTorch implementation of the paper 'Attention is all you need' (NeurIPS 2017)

Prediction result : results/pred.txt
12 Epochs (with train loss 1.3013) : BLEU = 30.59, 63.6/38.1/25.5/17.0 (BP=0.956, ratio=0.957, hyp_len=11710, ref_len=12242)
21 Epochs (with train loss 0.7675) : BLEU = 32.81, 63.5/39.1/26.6/17.9 (BP=0.994, ratio=0.994, hyp_len=12174, ref_len=12242)
39 Epochs (with train loss 0.2197) : BLEU = 31.77, 62.9/37.6/25.3/17.1 (BP=1.000, ratio=1.007, hyp_len=12327, ref_len=12242)

Provide feedback