[BUG] - Accelerating PyTorch Transformers by replacing nn.Transformer with Nested Tensors and torch.compile() #3176

AdrianOrenstein · 2024-12-06T05:18:42Z

Add Link

https://pytorch.org/tutorials/intermediate/transformer_building_blocks.html

Describe the bug

Unfinished sentence in the tutorial:

"Thanks to this PR this is no longer the case. Instead, fully masked rows in scaled_dot_product_attention [missing text]. For cases where nn.MHA does not employ the “fast-path”, this will also apply."

Describe your environment

Brave Browser.

mikaylagawarecki · 2024-12-19T20:02:36Z

Thanks for catching this

AdrianOrenstein added the bug label Dec 6, 2024

mikaylagawarecki self-assigned this Dec 19, 2024

mikaylagawarecki mentioned this issue Dec 23, 2024

Validate transformer tutorial builds against 2.6 #3202

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] - Accelerating PyTorch Transformers by replacing nn.Transformer with Nested Tensors and torch.compile() #3176

[BUG] - Accelerating PyTorch Transformers by replacing nn.Transformer with Nested Tensors and torch.compile() #3176

AdrianOrenstein commented Dec 6, 2024 •

edited

Loading

mikaylagawarecki commented Dec 19, 2024

[BUG] - Accelerating PyTorch Transformers by replacing nn.Transformer with Nested Tensors and torch.compile() #3176

[BUG] - Accelerating PyTorch Transformers by replacing nn.Transformer with Nested Tensors and torch.compile() #3176

Comments

AdrianOrenstein commented Dec 6, 2024 • edited Loading

Add Link

Describe the bug

Describe your environment

mikaylagawarecki commented Dec 19, 2024

AdrianOrenstein commented Dec 6, 2024 •

edited

Loading