Minor blocksparse refactoring, update block size restrictions, relax power of two constraint #277

colehawkins · 2022-04-20T00:26:42Z

What does this PR do?

Minor blocksparse refactoring, update block size restrictions, relax power of two constraint.

Before submitting

[ X] Did you have fun?
- Make sure you had fun coding 🙃
[ X] Did you read the contributor guideline?
Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
- N/A
Did you make sure to update the docs?
- N/A
[ X] Did you write any new necessary tests?
- N/A
Did you update the changelog? (if needed)
- N/A

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

blefaudeux

~~Looks good to me, if you can run black to fix the lint/format ? Else we can merge and I run that on the other branch ?~~

blefaudeux · 2022-04-20T02:55:31Z

tests/test_triton_blocksparse.py

 def test_attention_fwd_bwd(
    block,
    input_scale=1.0,
    scale=1 / 8.0,
-    n_ctx=256,
+    n_ctx=384,


Not too bad for the unit test time of execution? We're trying to keep these small on purpose

blefaudeux · 2022-04-20T02:55:54Z

tests/test_triton_blocksparse.py

    query, key, value = [x.clone() for x in qkvs]
    query.retain_grad()
    key.retain_grad()
    value.retain_grad()
-    if block not in [16, 32, 64]:
+    if block not in [16, 32, 64, 128]:


blefaudeux · 2022-04-20T02:56:57Z

xformers/components/attention/blocksparse.py

-                    self.block_size,
-                    device=q.device,
-                )
+                self.create_triton_kernels(q.device)


blefaudeux · 2022-04-20T02:57:34Z

xformers/components/attention/blocksparse.py

-                q.shape[-2], 2
-            ).is_integer(), (
-                "For now blocksparse only works on power-of-two sequence lengths"
+            assert (


Nice catch!

blefaudeux · 2022-04-20T02:58:49Z

Thanks a bunch @colehawkins !

blefaudeux · 2022-04-20T03:10:58Z

Looks good to me, if you can run black to fix the lint/format ? Else we can merge and I run that on the other branch ?

Actually I can merge and format, don't bother

blefaudeux

Thanks again @colehawkins ! merging

…power of two constraint (#277) * Relax device size restrictions * Refactor device creation and run all tests * linting Co-authored-by: Cole Hawkins <colehawk@amazon.com>

@fmassa

* parent be72b26 author Kashif Rasul <kashif.rasul@gmail.com> 1648069860 +0100 committer Benjamin Lefaudeux <benjamin.lefaudeux@pm.me> 1650256563 -0700 Move to Triton 2 Author: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Benjamin Lefaudeux <benjamin.lefaudeux@pm.me> Tentatively fixing layernorm - faster all around - bugfix better take on sparse tensors, put layout on the correct device update the pip packages, minor cleanup * catering for triton blocksparse being probably more reliable in fp16 * faster layernorm * Minor blocksparse refactoring, update block size restrictions, relax power of two constraint (#277) * Relax device size restrictions * Refactor device creation and run all tests * linting Co-authored-by: Cole Hawkins <colehawk@amazon.com> * code review, thanks @fmassa ! Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: colepshawkins <31542048+colehawkins@users.noreply.github.com> Co-authored-by: Cole Hawkins <colehawk@amazon.com>

@fmassa

…h combo (#271) * testing using conda to get the pytorch nightlies and matching cuda * [fix] Making it explicit whether the attention mechanism supports an attention mask or not (#266) check the assert * [backend] 3/3 Triton 2 update (#272) * parent be72b26 author Kashif Rasul <kashif.rasul@gmail.com> 1648069860 +0100 committer Benjamin Lefaudeux <benjamin.lefaudeux@pm.me> 1650256563 -0700 Move to Triton 2 Author: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Benjamin Lefaudeux <benjamin.lefaudeux@pm.me> Tentatively fixing layernorm - faster all around - bugfix better take on sparse tensors, put layout on the correct device update the pip packages, minor cleanup * catering for triton blocksparse being probably more reliable in fp16 * faster layernorm * Minor blocksparse refactoring, update block size restrictions, relax power of two constraint (#277) * Relax device size restrictions * Refactor device creation and run all tests * linting Co-authored-by: Cole Hawkins <colehawk@amazon.com> * code review, thanks @fmassa ! Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: colepshawkins <31542048+colehawkins@users.noreply.github.com> Co-authored-by: Cole Hawkins <colehawk@amazon.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: colepshawkins <31542048+colehawkins@users.noreply.github.com> Co-authored-by: Cole Hawkins <colehawk@amazon.com>

amazon-colehawk added 3 commits April 19, 2022 23:53

Relax device size restrictions

8e842ea

Refactor device creation and run all tests

963e39f

linting

8ea9742

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 20, 2022

colehawkins mentioned this pull request Apr 20, 2022

[backend] 3/3 Triton 2 update #272

Merged

15 tasks

blefaudeux reviewed Apr 20, 2022

View reviewed changes

blefaudeux approved these changes Apr 20, 2022

View reviewed changes

blefaudeux merged commit b212063 into facebookresearch:triton-2 Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor blocksparse refactoring, update block size restrictions, relax power of two constraint #277

Minor blocksparse refactoring, update block size restrictions, relax power of two constraint #277

colehawkins commented Apr 20, 2022 •

edited

Loading

blefaudeux left a comment •

edited

Loading

blefaudeux Apr 20, 2022

blefaudeux Apr 20, 2022

blefaudeux Apr 20, 2022

blefaudeux Apr 20, 2022

blefaudeux commented Apr 20, 2022

blefaudeux commented Apr 20, 2022

blefaudeux left a comment

Minor blocksparse refactoring, update block size restrictions, relax power of two constraint #277

Minor blocksparse refactoring, update block size restrictions, relax power of two constraint #277

Conversation

colehawkins commented Apr 20, 2022 • edited Loading

What does this PR do?

Before submitting

PR review

blefaudeux left a comment • edited Loading

Choose a reason for hiding this comment

blefaudeux Apr 20, 2022

Choose a reason for hiding this comment

blefaudeux Apr 20, 2022

Choose a reason for hiding this comment

blefaudeux Apr 20, 2022

Choose a reason for hiding this comment

blefaudeux Apr 20, 2022

Choose a reason for hiding this comment

blefaudeux commented Apr 20, 2022

blefaudeux commented Apr 20, 2022

blefaudeux left a comment

Choose a reason for hiding this comment

colehawkins commented Apr 20, 2022 •

edited

Loading

blefaudeux left a comment •

edited

Loading