[Llama FA2] Re-add _expand_attention_mask and clean a couple things #27074

patrickvonplaten · 2023-10-25T21:50:55Z

What does this PR do?

This PR cleans the attention mask converter a bit more, corrects some docstrings and removes outdated comments and deprecates _expand_attention_mask to fix optimum.

src/transformers/models/llama/modeling_llama.py

…mers into clean_llama

patrickvonplaten · 2023-10-25T22:28:18Z

@ArthurZucker could you give this a quick review? It'd make the Bart FA PR much easier to continue and should also fix the better transformers problem with optimum

HuggingFaceDocBuilderDev · 2023-10-25T22:39:41Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker · 2023-10-26T07:56:44Z

Of course!

src/transformers/models/llama/modeling_llama.py

ArthurZucker · 2023-10-26T08:00:59Z

src/transformers/models/llama/modeling_llama.py

+def _expand_mask(mask: torch.Tensor, dtype: torch.dtype, tgt_len: Optional[int] = None):
+    warnings.warn(
+        "Calling `transformers.models.llama.modeling_llama._expand_mask` is deprecated and will be removed in v4.37. Use `transformers.models.llama.modeling_llama.AttnMaskConverter._expand_mask"
+    )
+    return AttnMaskConverter._expand_mask(mask=mask, dtype=dtype, tgt_len=tgt_len)


Nice! We should probably do the same for falcon and mistral as well

Think in optimum only the llama mask utils are imported: https://github.com/huggingface/optimum/blob/313e1bd0de2b44aaa71797464f1e8b6a041a6f18/optimum/bettertransformer/models/attention.py#L25

ok 👍🏻

src/transformers/models/llama/modeling_llama.py

…uggingface#27074) * clean * clean llama * fix more * make style * Apply suggestions from code review * Apply suggestions from code review * Update src/transformers/models/llama/modeling_llama.py * Update src/transformers/models/llama/modeling_llama.py * Apply suggestions from code review * finish * make style

patrickvonplaten added 4 commits October 25, 2023 23:37

clean

ed155a4

clean llama

b3d8ad4

fix more

61a9d85

make style

beb1060

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Apply suggestions from code review

cfb1fc8

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Apply suggestions from code review

58e2bd4

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Update src/transformers/models/llama/modeling_llama.py

a38c96d

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten added 2 commits October 26, 2023 00:09

Update src/transformers/models/llama/modeling_llama.py

fdbc754

Merge branch 'clean_llama' of https://github.com/huggingface/transfor…

867aa24

…mers into clean_llama

patrickvonplaten changed the title ~~clean~~ [Llama FA2] Re-add _expand_attention_mask and clean a couple things Oct 25, 2023

patrickvonplaten mentioned this pull request Oct 25, 2023

ImportError using Llama 2 with BetterTransformer huggingface/optimum#1481

Closed

4 tasks

patrickvonplaten requested a review from ArthurZucker October 25, 2023 22:27

ArthurZucker approved these changes Oct 26, 2023

View reviewed changes

patrickvonplaten commented Oct 26, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten commented Oct 26, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten added 3 commits October 26, 2023 12:14

Apply suggestions from code review

3882d35

finish

4e87b9b

make style

76e2b3f

patrickvonplaten merged commit d7cb5e1 into main Oct 26, 2023
3 checks passed

patrickvonplaten deleted the clean_llama branch October 26, 2023 11:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Llama FA2] Re-add _expand_attention_mask and clean a couple things #27074

[Llama FA2] Re-add _expand_attention_mask and clean a couple things #27074

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading

ArthurZucker commented Oct 26, 2023

ArthurZucker Oct 26, 2023

patrickvonplaten Oct 26, 2023

ArthurZucker Oct 26, 2023

[Llama FA2] Re-add _expand_attention_mask and clean a couple things #27074

[Llama FA2] Re-add _expand_attention_mask and clean a couple things #27074

Conversation

patrickvonplaten commented Oct 25, 2023 • edited Loading

What does this PR do?

patrickvonplaten commented Oct 25, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Oct 25, 2023 • edited Loading

ArthurZucker commented Oct 26, 2023

ArthurZucker Oct 26, 2023

Choose a reason for hiding this comment

patrickvonplaten Oct 26, 2023

Choose a reason for hiding this comment

ArthurZucker Oct 26, 2023

Choose a reason for hiding this comment

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading