Benchmarking updates for semi-structured sparse training #398

jcaip · 2024-06-18T23:35:21Z

Summary:

This PR does the following:

adds e2e ViT benchmarks for semi-structured sparse training
adds nn.Linear microbenchmarks
removes extra xformers benchmarking utils I copied over
removes MLP block benchmarks
updated README.md with new benchmarks + accuracy benchmarks

Given we have nn.Linear microbenchmarks and e2e benchmarks, I felt that
the MLP block benchmarks were unnecessary

As a sanity check, I ran the MLP benchmarks with the new benchmarking
suite and the old one, and got the same results:

NEW:

OLD:

	dense	w24
f16 (44160,1024,4096,1024)	11534.3	9204.7

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: This PR does the following: - adds e2e ViT benchmarks for semi-structured sparse training - adds nn.Linear microbenchmarks - removes extra xformers benchmarking utils I copied over - removes MLP block benchmarks - updated README.md with new benchmarks + accuracy benchmarks Given we have nn.Linear microbenchmarks and e2e benchmarks, I felt that the MLP block benchmarks were unnecessary As a sanity check, I ran the MLP benchmarks with the new benchmarking suite and the old one, and got the same results: Test Plan: Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2024-06-18T23:35:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/398

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 881ae2c with merge base 6b0ca2d ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Run Regression Tests / test (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://download.pytorc... / linux-job (gh) (trunk failure)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim

Cool thank you! This is significantly clearer. I do want us to think a bit harder about the top line metric since 6% might not be super compelling to people not familiar with limitations of sparsity

jcaip · 2024-06-20T15:43:18Z

@msaroufim We could compare to masking based approaches (which are slower than dense training) for a larger number, but I think it'd be a bit confusing since I'm assuming most users are coming with a dense model and not an existing sparse training script they want to accelerate.

* Benchmarking updates for semi-structured sparse training Summary: This PR does the following: - adds e2e ViT benchmarks for semi-structured sparse training - adds nn.Linear microbenchmarks - removes extra xformers benchmarking utils I copied over - removes MLP block benchmarks - updated README.md with new benchmarks + accuracy benchmarks Given we have nn.Linear microbenchmarks and e2e benchmarks, I felt that the MLP block benchmarks were unnecessary As a sanity check, I ran the MLP benchmarks with the new benchmarking suite and the old one, and got the same results: Test Plan: Reviewers: Subscribers: Tasks: Tags: * update * add units

jcaip added 2 commits June 18, 2024 16:17

update

395bf2d

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 18, 2024

add units

881ae2c

jcaip requested a review from msaroufim June 18, 2024 23:40

msaroufim approved these changes Jun 19, 2024

View reviewed changes

jcaip merged commit 5559405 into main Jun 20, 2024
12 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking updates for semi-structured sparse training #398

Benchmarking updates for semi-structured sparse training #398

jcaip commented Jun 18, 2024 •

edited

Loading

pytorch-bot bot commented Jun 18, 2024 •

edited

Loading

msaroufim left a comment

jcaip commented Jun 20, 2024

Benchmarking updates for semi-structured sparse training #398

Benchmarking updates for semi-structured sparse training #398

Conversation

jcaip commented Jun 18, 2024 • edited Loading

pytorch-bot bot commented Jun 18, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/398

✅ You can merge normally! (1 Unrelated Failure)

msaroufim left a comment

Choose a reason for hiding this comment

jcaip commented Jun 20, 2024

jcaip commented Jun 18, 2024 •

edited

Loading

pytorch-bot bot commented Jun 18, 2024 •

edited

Loading