MultiHeadAttention Layer #1062

cgarciae · 2020-02-10T04:07:36Z

Implementation of MultiHeadAttention as presented in Attention Is All You Need and discussed in #951. Uses tf.einsum to generalize dot-product to multiple heads.

Missing:

Documentation
Tests
Parameters for weight initializers, regularizers, constaints, etc.
config method

References:

tensorflow/models

guillaumekln

Thanks for the PR! I have some interests in this, so here are some comments/questions:

tensorflow_addons/layers/multihead_attention.py

cgarciae · 2020-02-10T14:58:45Z

@guillaumekln thanks for the comments! I've updated the code to address some of them.

cgarciae · 2020-02-25T05:25:24Z

@guillaumekln @AakashKumarNain @facaiy @seanpmorgan

Code should be ready for review :) Only 2 things are missing:

Finish docstring
Fix Github CI Issues

cgarciae · 2020-02-25T18:23:03Z

Anyone knows what is wrong with flake8? I am not getting any error from flake8 locally.

ulf1 · 2020-02-26T12:21:01Z

tensorflow_addons/layers/multihead_attention.py

+        bias_initializer: typing.Union[str, typing.Callable] = "zeros",
+        bias_regularizer: typing.Union[str, typing.Callable] = None,
+        bias_constraint: typing.Union[str, typing.Callable] = None,
+        **kwargs,


My guess is that the comma after **kwargs will cause E999 SyntaxError: invalid syntax in the flake8 test. You can run flake8 tensorflow_addons/layers/multihead_attention.py directly to check it out

cgarciae · 2020-02-26T14:33:54Z

@ulf1 I see, thanks! black is automatically adding that comma :( I'll disable "format on save" to remove it.

gabrieldemarmiesse · 2020-02-26T15:00:09Z

I've merged master into your branch to update it and fixed any formatting/conflicts it might have. If you need to do some more modifications, please do git pull beforehand.

cgarciae · 2020-02-26T19:42:15Z

The pre-commit.sh script modified a bunch of files unrelated to this PR, possibly the ones added during the merge by @gabrieldemarmiesse .

gabrieldemarmiesse · 2020-02-26T19:46:07Z

I'll look into it and push the fix to your branch. Thanks for the heads up :)

cgarciae · 2020-02-27T20:20:36Z

@Squadrick I added a small commit, I don't know if that ticked off the kokoro stuff.

qlzh727

Thanks for the change, more comments about the unit test.

tensorflow_addons/layers/multihead_attention.py

tensorflow_addons/layers/multihead_attention_test.py

cgarciae · 2020-03-03T17:08:13Z

@qlzh727 the changes you requested where made.

seanpmorgan

Almost LGTM. Also will you be willing to maintain this going forward? If so please add it to the CODEOWNERS file

tensorflow_addons/layers/multihead_attention_test.py

tensorflow_addons/layers/__init__.py

cgarciae · 2020-03-09T03:05:12Z

@seanpmorgan Yeah, happy to maintain it. Added entry to CODEOWNERS.

tensorflow_addons/layers/multihead_attention_test.py

seanpmorgan · 2020-03-09T19:13:58Z

@cgarciae Sorry for the conflicts. Could you resolve and then LGTM

cgarciae · 2020-03-09T19:42:27Z

@seanpmorgan no problem. Conflicts solved!

seanpmorgan

LGTM thanks for this great contribution! Will leave the PR open for another day in case any other the other reviewers have any issues.

seanpmorgan · 2020-03-10T00:46:17Z

LGTM thanks for this great contribution! Will leave the PR open for another day in case any other the other reviewers have any issues.

@qlzh727 please let us know if the changes you requested are sufficient. I believe they were addressed.

qlzh727

LGTM.

* Add MultiHeadAttention Layer

cgarciae requested review from facaiy and seanpmorgan as code owners February 10, 2020 04:07

boring-cyborg bot added the layers label Feb 10, 2020

googlebot added the cla: yes label Feb 10, 2020

guillaumekln reviewed Feb 10, 2020

View reviewed changes

cgarciae changed the title ~~[WIP] MultiHeadAttention Layer~~ MultiHeadAttention Layer Feb 25, 2020

ulf1 reviewed Feb 26, 2020

View reviewed changes

This comment has been minimized.

Sign in to view

googlebot added cla: no and removed cla: yes labels Feb 26, 2020

This comment has been minimized.

Sign in to view

googlebot added cla: yes and removed cla: no labels Feb 26, 2020

cgarciae requested review from Squadrick and WindQAQ as code owners February 26, 2020 19:31

boring-cyborg bot added losses metrics optimizers text labels Feb 26, 2020

cgarciae requested review from qlzh727 and a team as code owners February 26, 2020 19:51

boring-cyborg bot added the seq2seq label Feb 26, 2020

kokoro-team removed the kokoro:force-run label Feb 27, 2020

impove Call Arguments

4a3edae

seanpmorgan added the kokoro:force-run label Feb 27, 2020

kokoro-team removed the kokoro:force-run label Feb 27, 2020

qlzh727 requested changes Feb 27, 2020

View reviewed changes

cgarciae added 4 commits February 29, 2020 11:26

compute_output_shape fix when return_attn_coef = True

051663b

additional tests

380d852

format code

5b31f8a

remove unsued import

7d6373a

seanpmorgan requested changes Mar 9, 2020

View reviewed changes

tensorflow_addons/layers/multihead_attention_test.py Show resolved Hide resolved

tensorflow_addons/layers/__init__.py Outdated Show resolved Hide resolved

cgarciae added 2 commits March 8, 2020 21:59

inports in alphabetical order

d530816

add entry to CODEOWNERS

4de2d08

boring-cyborg bot added the github label Mar 9, 2020

cgarciae added 3 commits March 8, 2020 23:42

fix code owners

26eaa63

code owners fix

ada57c4

Merge branch 'master' into mha

e0bed0c

seanpmorgan requested changes Mar 9, 2020

View reviewed changes

tensorflow_addons/layers/multihead_attention_test.py Outdated Show resolved Hide resolved

cgarciae added 2 commits March 9, 2020 11:54

update test main

8ba5a86

Merge branch 'mha' of github.com:cgarciae/addons into mha

4fe4f1c

Merge branch 'master' into mha

a1101f1

seanpmorgan approved these changes Mar 10, 2020

View reviewed changes

qlzh727 approved these changes Mar 10, 2020

View reviewed changes

seanpmorgan merged commit 3b0d978 into tensorflow:master Mar 10, 2020

jrruijli pushed a commit to jrruijli/addons that referenced this pull request Dec 23, 2020

MultiHeadAttention Layer (tensorflow#1062)

84acbc2

* Add MultiHeadAttention Layer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultiHeadAttention Layer #1062

MultiHeadAttention Layer #1062

cgarciae commented Feb 10, 2020 •

edited

Loading

guillaumekln left a comment

cgarciae commented Feb 10, 2020

cgarciae commented Feb 25, 2020 •

edited

Loading

cgarciae commented Feb 25, 2020 •

edited

Loading

ulf1 Feb 26, 2020

cgarciae commented Feb 26, 2020

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

gabrieldemarmiesse commented Feb 26, 2020

cgarciae commented Feb 26, 2020

gabrieldemarmiesse commented Feb 26, 2020

cgarciae commented Feb 27, 2020

qlzh727 left a comment

cgarciae commented Mar 3, 2020

seanpmorgan left a comment

cgarciae commented Mar 9, 2020

seanpmorgan commented Mar 9, 2020

cgarciae commented Mar 9, 2020

seanpmorgan left a comment

seanpmorgan commented Mar 10, 2020

qlzh727 left a comment

MultiHeadAttention Layer #1062

MultiHeadAttention Layer #1062

Conversation

cgarciae commented Feb 10, 2020 • edited Loading

guillaumekln left a comment

Choose a reason for hiding this comment

cgarciae commented Feb 10, 2020

cgarciae commented Feb 25, 2020 • edited Loading

cgarciae commented Feb 25, 2020 • edited Loading

ulf1 Feb 26, 2020

Choose a reason for hiding this comment

cgarciae commented Feb 26, 2020

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

gabrieldemarmiesse commented Feb 26, 2020

cgarciae commented Feb 26, 2020

gabrieldemarmiesse commented Feb 26, 2020

cgarciae commented Feb 27, 2020

qlzh727 left a comment

Choose a reason for hiding this comment

cgarciae commented Mar 3, 2020

seanpmorgan left a comment

Choose a reason for hiding this comment

cgarciae commented Mar 9, 2020

seanpmorgan commented Mar 9, 2020

cgarciae commented Mar 9, 2020

seanpmorgan left a comment

Choose a reason for hiding this comment

seanpmorgan commented Mar 10, 2020

qlzh727 left a comment

Choose a reason for hiding this comment

cgarciae commented Feb 10, 2020 •

edited

Loading

cgarciae commented Feb 25, 2020 •

edited

Loading

cgarciae commented Feb 25, 2020 •

edited

Loading