[fix] Layer positions labelling & layernorm #348

blefaudeux · 2022-07-03T13:09:24Z

What does this PR do?

Fixes #347

Before submitting

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

blefaudeux · 2022-07-03T13:30:04Z

xformers/factory/model_factory.py

@@ -149,10 +149,12 @@ def __init__(
            for i in range(config.num_layers):
                # Label where this layer is in the stack
                # (for instance useful for the positional encoding, or late layer norm)
-                if i > 0:


this would count within the repeated layers, not the overall layer stack

blefaudeux · 2022-07-03T13:37:50Z

xformers/factory/block_factory.py

@@ -175,10 +175,7 @@ def __init__(self, config: xFormerEncoderConfig, **kwargs):
        # Optional patch embedding
        self.patch_emb: Optional[nn.Module] = None

-        if (


this was not respecting the config, while trying to do the right thing: if the config was asking for a patch embedding, but the layer was not first, it would not be instantiated. In retrospect I think that it's risky, not doing what the API says it will do, plus it only worked in practice because is_first() was often wrong. I think now that it's better to respect the config no matter what, and not silently diverge

codecov-commenter · 2022-07-03T13:57:17Z

Codecov Report

Merging #348 (f8f4972) into hierachical_models_improvement (ebc4f6f) will not change coverage.
The diff coverage is 100.00%.

@@                       Coverage Diff                       @@
##           hierachical_models_improvement     #348   +/-   ##
===============================================================
  Coverage                           93.95%   93.95%           
===============================================================
  Files                                  70       70           
  Lines                                3984     3984           
===============================================================
  Hits                                 3743     3743           
  Misses                                241      241

Flag	Coverage Δ
Python	`93.95% <100.00%> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
xformers/factory/block_factory.py	`97.03% <100.00%> (ø)`
xformers/factory/model_factory.py	`98.16% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ebc4f6f...f8f4972. Read the comment docs.

dianaml0

Nice catch!

) * handling different normalizations + layer repetition * bugfix localizing the layers in the stack (#348) * renaming the layer_norm_style param when building from config Co-authored-by: Benjamin Lefaudeux <lefaudeux@Benjamins-MacBook-Pro.local>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 3, 2022

blefaudeux changed the base branch from main to hierachical_models_improvement July 3, 2022 13:10

blefaudeux commented Jul 3, 2022

View reviewed changes

blefaudeux force-pushed the hierachical_models_improvement branch from e5f22e4 to ebc4f6f Compare July 3, 2022 13:31

bugfix localizing the layers in the stack

f8f4972

blefaudeux force-pushed the fix_layer_positions branch from 67ba72f to f8f4972 Compare July 3, 2022 13:32

blefaudeux commented Jul 3, 2022

View reviewed changes

dianaml0 approved these changes Jul 5, 2022

View reviewed changes

dianaml0 merged commit 58b36eb into facebookresearch:hierachical_models_improvement Jul 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Layer positions labelling & layernorm #348

[fix] Layer positions labelling & layernorm #348

blefaudeux commented Jul 3, 2022

blefaudeux Jul 3, 2022

blefaudeux Jul 3, 2022

codecov-commenter commented Jul 3, 2022

dianaml0 left a comment

[fix] Layer positions labelling & layernorm #348

[fix] Layer positions labelling & layernorm #348

Conversation

blefaudeux commented Jul 3, 2022

What does this PR do?

Before submitting

PR review

blefaudeux Jul 3, 2022

Choose a reason for hiding this comment

blefaudeux Jul 3, 2022

Choose a reason for hiding this comment

codecov-commenter commented Jul 3, 2022

Codecov Report

dianaml0 left a comment

Choose a reason for hiding this comment