Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FlashAttention needs contiguous gradients #548

Merged
merged 2 commits into from
Nov 29, 2022
Merged

FlashAttention needs contiguous gradients #548

merged 2 commits into from
Nov 29, 2022

Conversation

fmassa
Copy link
Contributor

@fmassa fmassa commented Nov 28, 2022

This was not visible before because the test was creating tensors of the wrong dtype, which when casted to the right dtype would yield contiguous tensors

This was not visible before because the test was creating tensors of the wrong dtype, which when casted to the right dtype would yield contiguous tensors
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 28, 2022
Copy link
Contributor

@danthe3rd danthe3rd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for spotting!

@fmassa fmassa merged commit c733c99 into main Nov 29, 2022
@fmassa fmassa deleted the fix-flash-grad branch November 29, 2022 10:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants