Port new seq2seq tutorial #130

chsasank · 2017-08-30T09:29:54Z

From https://github.com/spro/practical-pytorch/tree/master/seq2seq-translation

cc @pytorch/team-text-core @Nayef211

poweihuang17 · 2018-08-13T03:52:33Z

BTW, what's the attention model implemented in official tutorial? The Bahdanau et al. model is said to be implemented, but it's actually not that one. The code used input embedding and hidden state to calculate the attention weight, but the Bahdanau et al. model should not be like this.

wzpfish · 2019-03-12T06:29:44Z

BTW, what's the attention model implemented in official tutorial? The Bahdanau et al. model is said to be implemented, but it's actually not that one. The code used input embedding and hidden state to calculate the attention weight, but the Bahdanau et al. model should not be like this.

I think it's a bug. Never seen the attention of this form before...

bbruceyuan · 2019-04-01T08:35:04Z

@wzpfish In my opinion, it's a wrong implementation of using MLP to compute attention scores. All of the encoder hidden states should be used.

wzpfish · 2019-04-01T14:11:50Z

@hey-bruce Look at the tutorial at https://pytorch.org/tutorials/intermediate/seq2seq_translation_tutorial.html

attn_weights = F.softmax(self.attn(torch.cat((embedded[0], hidden[0]), 1)), dim=1)

This attention weights do not use any of the encoder hidden states. I think there is something wrong.

rajarsheem · 2019-06-18T09:44:02Z

@wzpfish Exactly, I raised an issue in their discuss forum. https://discuss.pytorch.org/t/possible-bug-in-seq2seq-tutorial/48241

Shortly afterwards, I saw your comment when I was searching for similar issue here if someone have already raised.

karkirowle · 2020-03-25T12:20:45Z

I was just studying this, and I came to the same realisation. The annotations should be used instead of the embedded output words, which would be the encoder outputs. It's not a one-line change, I think.
Also, Bahdanau uses bidirectional recurrent neural networks.

karkirowle · 2020-03-29T10:40:54Z

I can make a pull request stating something like "this is being reworked", but the tutorial is not appropriate in this form, and this really needs to be changed as soon as possible. You can even see that the learnt attention is wrong. I can start working on making an updated version of this tutorial eventually if needed.

QasimKhan5x · 2023-06-04T17:43:00Z

/assigntome

chsasank self-assigned this Aug 30, 2017

svekars added the module: torchtext label Dec 2, 2022

Nayef211 mentioned this issue Dec 7, 2022

Sequence to sequence encoder-decoder attention not looking at encoder hidden states #2121

Closed

svekars added medium docathon-h1-2023 A label for the docathon in H1 2023 labels May 31, 2023

sekyondaMeta unassigned chsasank May 31, 2023

github-actions bot assigned QasimKhan5x Jun 4, 2023

This was referenced Jun 8, 2023

Fix Attention in seq2seq_translation_tutorial AttnDecoderRNN #2449

Closed

Fix Attention in seq2seq_translation_tutorial AttnDecoderRNN #2450

Closed

Fix Attention in seq2seq_translation_tutorial AttnDecoderRNN #2452

Merged

svekars closed this as completed in #2452 Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port new seq2seq tutorial #130

Port new seq2seq tutorial #130

chsasank commented Aug 30, 2017 •

edited by pytorch-bot bot

Loading

poweihuang17 commented Aug 13, 2018

wzpfish commented Mar 12, 2019

bbruceyuan commented Apr 1, 2019

wzpfish commented Apr 1, 2019

rajarsheem commented Jun 18, 2019

karkirowle commented Mar 25, 2020 •

edited

Loading

karkirowle commented Mar 29, 2020

QasimKhan5x commented Jun 4, 2023

Port new seq2seq tutorial #130

Port new seq2seq tutorial #130

Comments

chsasank commented Aug 30, 2017 • edited by pytorch-bot bot Loading

poweihuang17 commented Aug 13, 2018

wzpfish commented Mar 12, 2019

bbruceyuan commented Apr 1, 2019

wzpfish commented Apr 1, 2019

rajarsheem commented Jun 18, 2019

karkirowle commented Mar 25, 2020 • edited Loading

karkirowle commented Mar 29, 2020

QasimKhan5x commented Jun 4, 2023

chsasank commented Aug 30, 2017 •

edited by pytorch-bot bot

Loading

karkirowle commented Mar 25, 2020 •

edited

Loading