-
Notifications
You must be signed in to change notification settings - Fork 634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bwinline: Forceinline all functions of the bw #453
Conversation
[ghstack-poisoned]
ghstack-source-id: ad077080cb8cfef645610b2bfbac9def4ef6e179 Pull Request resolved: #453
Codecov ReportBase: 91.39% // Head: 91.39% // No change to project coverage 👍
Additional details and impacted files@@ Coverage Diff @@
## gh/danthe3rd/44/base #453 +/- ##
=====================================================
Coverage 91.39% 91.39%
=====================================================
Files 75 75
Lines 4417 4417
=====================================================
Hits 4037 4037
Misses 380 380
Flags with carried forward coverage won't be shown. Click here to find out more. Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
head_dim = warp_uniform(head_dim); | ||
head_dim_value = warp_uniform(head_dim_value); | ||
num_queries = warp_uniform(num_queries); | ||
num_keys = warp_uniform(num_keys); | ||
|
||
query_ptr = warp_uniform(query_ptr); | ||
key_ptr = warp_uniform(key_ptr); | ||
value_ptr = warp_uniform(value_ptr); | ||
logsumexp_ptr = warp_uniform(logsumexp_ptr); | ||
output_ptr = warp_uniform(output_ptr); | ||
grad_output_ptr = warp_uniform(grad_output_ptr); | ||
delta_ptr = warp_uniform(delta_ptr); | ||
|
||
grad_query_ptr = warp_uniform(grad_query_ptr); | ||
grad_key_ptr = warp_uniform(grad_key_ptr); | ||
grad_value_ptr = warp_uniform(grad_value_ptr); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
note to self: this trick is needed otherwise there would be a performance slowdown
ghstack-source-id: ad077080cb8cfef645610b2bfbac9def4ef6e179 Pull Request resolved: #453
Stack from ghstack (oldest at bottom):