Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve build time by ~30% #539

Merged
merged 7 commits into from
Nov 29, 2022
Merged

Conversation

danthe3rd
Copy link
Contributor

@danthe3rd danthe3rd commented Nov 24, 2022

Stack from ghstack (oldest at bottom):

... by reducing the number of ATen imports, and skipping them altogether when building the actual kernels

13mn -> 9mn on Sm61 (CI, does not build flash)

[ghstack-poisoned]
danthe3rd pushed a commit that referenced this pull request Nov 24, 2022
ghstack-source-id: 504b6d0a01cf4df9daa82264544e01da466f83a3
Pull Request resolved: #539
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 24, 2022
[ghstack-poisoned]
danthe3rd pushed a commit that referenced this pull request Nov 24, 2022
ghstack-source-id: 4b2445fe3c83eef3282643862ed83cef85dc5997
Pull Request resolved: #539
[ghstack-poisoned]
danthe3rd pushed a commit that referenced this pull request Nov 24, 2022
ghstack-source-id: eee2e10144ca54bb23e34d6ff5988faa3308c08d
Pull Request resolved: #539
[ghstack-poisoned]
danthe3rd pushed a commit that referenced this pull request Nov 24, 2022
ghstack-source-id: bb993fcd50536e112fcb478e7128d76da3b1f195
Pull Request resolved: #539
@codecov-commenter
Copy link

codecov-commenter commented Nov 24, 2022

Codecov Report

Base: 89.79% // Head: 89.79% // No change to project coverage 👍

Coverage data is based on head (dfd494a) compared to base (dfd494a).
Patch has no changes to coverable lines.

❗ Current head dfd494a differs from pull request most recent head c7f5164. Consider uploading reports for the commit c7f5164 to get more accurate results

Additional details and impacted files
@@                  Coverage Diff                  @@
##           gh/danthe3rd/56/base     #539   +/-   ##
=====================================================
  Coverage                 89.79%   89.79%           
=====================================================
  Files                        80       80           
  Lines                      4839     4839           
=====================================================
  Hits                       4345     4345           
  Misses                      494      494           
Flag Coverage Δ
Python 89.79% <0.00%> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

@danthe3rd danthe3rd marked this pull request as ready for review November 24, 2022 22:03
@danthe3rd danthe3rd changed the title Improve build time Improve build time by ~30% Nov 24, 2022
... by reducing the number of ATen imports, and skipping them altogether when building the actual kernels

13mn -> 9mn on Sm61 (CI, does not build flash)

[ghstack-poisoned]
Copy link
Contributor

@fmassa fmassa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@@ -384,7 +384,7 @@ class MmaPipelinedFromSharedMemory : public MmaBaseFromSharedMemory<
// but not supported as it worsens perf: older gpus < sm80 don't
// support async tranfers and have to waste registers
CUTLASS_DEVICE
bool set_prologue_done(bool value) {}
void set_prologue_done(bool value) {}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Those changes are unrelated, right?

@@ -676,18 +671,19 @@ struct AttentionBackwardKernel {
}
};

static void __host__ check_supported(Params const& p) {
static bool __host__ check_supported(Params const& p) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This change is also unrelated, right?

... by reducing the number of ATen imports, and skipping them altogether when building the actual kernels

13mn -> 9mn on Sm61 (CI, does not build flash)

[ghstack-poisoned]
... by reducing the number of ATen imports, and skipping them altogether when building the actual kernels

13mn -> 9mn on Sm61 (CI, does not build flash)

[ghstack-poisoned]
@danthe3rd danthe3rd merged commit 928b0df into gh/danthe3rd/56/base Nov 29, 2022
danthe3rd pushed a commit that referenced this pull request Nov 29, 2022
ghstack-source-id: a083a9494486298191eea001ff480a82af6966c7
Pull Request resolved: #539
@danthe3rd danthe3rd deleted the gh/danthe3rd/56/head branch November 29, 2022 17:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants