Fix AMD FP8 Test and use native rowwise quantization in benchmark #2849

jwfromm · 2024-07-15T20:33:55Z

Summary:
Fix a minor test issue where triton blockwise quantization was running on AMD despite not being supported.

I also switch rowwise quantization in our fp8 benchmarks to the native hip implementation.

Reviewed By: jianyuh

Differential Revision: D59771162

netlify · 2024-07-15T20:34:13Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`75ff00a`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66969f4e3113b40008751659
😎 Deploy Preview	https://deploy-preview-2849--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2024-07-15T20:34:18Z

This pull request was exported from Phabricator. Differential Revision: D59771162

facebook-github-bot · 2024-07-16T16:14:58Z

This pull request was exported from Phabricator. Differential Revision: D59771162

…torch#2849) Summary: Pull Request resolved: pytorch#2849 Fix a minor test issue where triton blockwise quantization was running on AMD despite not being supported. I also switch rowwise quantization in our fp8 benchmarks to the native hip implementation. Reviewed By: jianyuh Differential Revision: D59771162

facebook-github-bot · 2024-07-16T16:20:51Z

This pull request was exported from Phabricator. Differential Revision: D59771162

…torch#2849) Summary: Pull Request resolved: pytorch#2849 Fix a minor test issue where triton blockwise quantization was running on AMD despite not being supported. I also switch rowwise quantization in our fp8 benchmarks to the native hip implementation. Reviewed By: jianyuh Differential Revision: D59771162

facebook-github-bot · 2024-07-16T16:26:49Z

This pull request was exported from Phabricator. Differential Revision: D59771162

facebook-github-bot · 2024-07-16T17:37:24Z

This pull request has been merged in 57a5969.

facebook-github-bot added the cla signed label Jul 15, 2024

facebook-github-bot added the fb-exported label Jul 15, 2024

jwfromm force-pushed the export-D59771162 branch from 37ba9ec to dd051db Compare July 16, 2024 16:14

jwfromm force-pushed the export-D59771162 branch from dd051db to 2cbdfbe Compare July 16, 2024 16:20

jwfromm force-pushed the export-D59771162 branch from 2cbdfbe to 75ff00a Compare July 16, 2024 16:26

facebook-github-bot closed this in 57a5969 Jul 16, 2024

facebook-github-bot added the Merged label Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AMD FP8 Test and use native rowwise quantization in benchmark #2849

Fix AMD FP8 Test and use native rowwise quantization in benchmark #2849

jwfromm commented Jul 15, 2024

netlify bot commented Jul 15, 2024 •

edited

Loading

facebook-github-bot commented Jul 15, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

Fix AMD FP8 Test and use native rowwise quantization in benchmark #2849

Fix AMD FP8 Test and use native rowwise quantization in benchmark #2849

Conversation

jwfromm commented Jul 15, 2024

netlify bot commented Jul 15, 2024 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Jul 15, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

facebook-github-bot commented Jul 16, 2024

netlify bot commented Jul 15, 2024 •

edited

Loading