[Fix] attn_dropout #123

blefaudeux · 2021-11-28T22:58:34Z

What does this PR do?

Fixes #122 + adds a unit test to catch faulty dropouts
@fmassa is rewriting some of that part, but in the meantime main should not be knowingly broken

Before submitting

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

blefaudeux · 2021-11-29T01:54:37Z

@fmassa @dianaml0 turns out that in many case dropout was not applied, not just for blocksparse, a unit test is catching that now :(

dianaml0 · 2021-11-29T01:58:01Z

@fmassa @dianaml0 turns out that in many case dropout was not applied, not just for blocksparse, a unit test is catching that now :(

Oh wow :( Great that you were able to catch that!

blefaudeux · 2021-11-29T02:03:26Z

@fmassa @dianaml0 turns out that in many case dropout was not applied, not just for blocksparse, a unit test is catching that now :(

Oh wow :( Great that you were able to catch that!

maybe a semi-recent change, I think that @fmassa is right in that we should refactor core/. Else for some variant the setting was silently ignored (but not explicitly present in the constructor), so it would just fail for the factory, not as bad but still... Unit tests for the win, this should not happen again

dianaml0

Really nice catch!

xformers/components/attention/core.py

dianaml0 · 2021-11-29T02:09:51Z

tests/test_attentions.py

-        _ = multi_head(inputs, inputs_shuffled, inputs)
+        att = multi_head(inputs, inputs_shuffled, inputs)
+
+        # Check that dropout actually drops some values


Great to have this test now!

xformers/components/attention/linformer.py

codecov-commenter · 2021-11-29T02:18:01Z

Codecov Report

Merging #123 (19b802e) into main (71bab94) will increase coverage by 0.04%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #123      +/-   ##
==========================================
+ Coverage   87.56%   87.61%   +0.04%     
==========================================
  Files          50       50              
  Lines        2558     2567       +9     
==========================================
+ Hits         2240     2249       +9     
  Misses        318      318

Flag	Coverage Δ
Python	`87.61% <100.00%> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...formers/components/attention/scaled_dot_product.py	`94.73% <ø> (ø)`
xformers/components/attention/blocksparse.py	`94.11% <100.00%> (ø)`
xformers/components/attention/core.py	`92.12% <100.00%> (+0.06%)`	⬆️
xformers/components/attention/fourier_mix.py	`100.00% <100.00%> (ø)`
xformers/components/attention/global_tokens.py	`100.00% <100.00%> (ø)`
xformers/components/attention/lambda_layer.py	`100.00% <100.00%> (ø)`
xformers/components/attention/linformer.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 71bab94...19b802e. Read the comment docs.

Very small update to doc, command for benchmarking

blefaudeux requested a review from dianaml0 November 28, 2021 22:58

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 28, 2021

blefaudeux requested review from fmassa and jieru-hu November 28, 2021 22:58

Drop just after the softmax

4bfb18f

blefaudeux force-pushed the blocksparse_dropout_fix branch from c389a1b to 4bfb18f Compare November 28, 2021 22:59

blefaudeux changed the title ~~[blocksparse] Fix dropout~~ [DRAFT][blocksparse] Fix dropout Nov 28, 2021

blefaudeux removed request for dianaml0, fmassa and jieru-hu November 29, 2021 00:01

blefaudeux marked this pull request as draft November 29, 2021 01:40

blefaudeux added 2 commits November 28, 2021 17:42

fixing a bunch of attentions, this is a good test

ef83b29

Fixing SDP, this is not good

2ff0bb3

blefaudeux requested review from dianaml0 and fmassa November 29, 2021 02:01

blefaudeux changed the title ~~[DRAFT][blocksparse] Fix dropout~~ [DRAFT] Fix attn_dropout Nov 29, 2021

dianaml0 approved these changes Nov 29, 2021

View reviewed changes

code review, thanks Diana

44ee4f3

blefaudeux marked this pull request as ready for review November 29, 2021 02:54

blefaudeux changed the title ~~[DRAFT] Fix attn_dropout~~ [Fix] attn_dropout Nov 29, 2021

updating the changelog

19b802e

blefaudeux merged commit 861493c into main Nov 29, 2021

blefaudeux deleted the blocksparse_dropout_fix branch November 29, 2021 04:13

xwhan pushed a commit to xwhan/xformers that referenced this pull request Feb 8, 2022

[fix] Update CONTRIBUTING.md (facebookresearch#123)

c2fde49

Very small update to doc, command for benchmarking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] attn_dropout #123

[Fix] attn_dropout #123

blefaudeux commented Nov 28, 2021

blefaudeux commented Nov 29, 2021

dianaml0 commented Nov 29, 2021

blefaudeux commented Nov 29, 2021

dianaml0 left a comment

dianaml0 Nov 29, 2021

codecov-commenter commented Nov 29, 2021 •

edited

Loading

[Fix] attn_dropout #123

[Fix] attn_dropout #123

Conversation

blefaudeux commented Nov 28, 2021

What does this PR do?

Before submitting

PR review

blefaudeux commented Nov 29, 2021

dianaml0 commented Nov 29, 2021

blefaudeux commented Nov 29, 2021

dianaml0 left a comment

Choose a reason for hiding this comment

dianaml0 Nov 29, 2021

Choose a reason for hiding this comment

codecov-commenter commented Nov 29, 2021 • edited Loading

Codecov Report

codecov-commenter commented Nov 29, 2021 •

edited

Loading