-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor tuning script #190
Conversation
Benchmark results for commit 158ce73 (comparing to 480924c):
Comparison with baseline
|
Timings are much more sensitive to throttling, so wait for the device to become idle and just take the minimal time measurement.
13200b5
to
373073e
Compare
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #190 +/- ##
=======================================
Coverage 32.04% 32.04%
=======================================
Files 11 11
Lines 958 958
=======================================
Hits 307 307
Misses 651 651 ☔ View full report in Codecov by Sentry. |
The goal is to make the script more resilient, i.e., automatically restarting workers if they fail. It should also be much faster, as I switched from doing relatively brute-force measurements, to waiting for the device to be unthrottled and doing only few measurements. Finally, I've also split off the WMMA/GEMM-specific parts so that it should be easier to add support for e.g. FPU/GEMM or TC tuning.