Refactor rest of tinygemm quant primitive ops #321

jerryzh168 · 2024-06-04T18:41:06Z

Summary:
This PR replaces the remaining tinygemm specific quant primitive ops with the general quant primitive ops that we want to use for everything, we could delete these ops in a separate PR if needed

Test Plan:
python test/quantization/test_quant_primitives.py -k test_get_groupwise_affine_qparams python test/quantization/test_quant_primitives.py -k test_groupwise_affine_quantize_tensor_from_qparams python test/quantization/test_quant_primitives.py -k test_groupwise_affine_dequantize_tensor_from_qparams

accuracy:

perf:
no diff for generated code with TORCH_LOGS='output_code' python tutorials/quantize_vit/run_vit_b_quant.py

pytorch-bot · 2024-06-04T18:41:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/321

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f2db23d with merge base 08fb8bf ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: This PR replaces the remaining tinygemm specific quant primitive ops with the general quant primitive ops that we want to use for everything, we could delete these ops in a separate PR if needed Test Plan: python test/quantization/test_quant_primitives.py -k test_get_groupwise_affine_qparams python test/quantization/test_quant_primitives.py -k test_groupwise_affine_quantize_tensor_from_qparams python test/quantization/test_quant_primitives.py -k test_groupwise_affine_dequantize_tensor_from_qparams accuracy: perf: no diff for generated code with `TORCH_LOGS='output_code' python tutorials/quantize_vit/run_vit_b_quant.py`

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 4, 2024

jerryzh168 force-pushed the refactor-quant-primitives branch from f0dd811 to 4e1d9f4 Compare June 4, 2024 18:42

jerryzh168 requested review from HDCharles, cpuhrsch and msaroufim June 4, 2024 18:42

cpuhrsch approved these changes Jun 4, 2024

View reviewed changes

jerryzh168 force-pushed the refactor-quant-primitives branch from 4e1d9f4 to 33aa8e7 Compare June 4, 2024 18:57

jerryzh168 force-pushed the refactor-quant-primitives branch from 33aa8e7 to f2db23d Compare June 5, 2024 14:11

jerryzh168 merged commit 03e2c9b into pytorch:main Jun 5, 2024
13 checks passed

jerryzh168 deleted the refactor-quant-primitives branch June 5, 2024 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor rest of tinygemm quant primitive ops #321

Refactor rest of tinygemm quant primitive ops #321

jerryzh168 commented Jun 4, 2024

pytorch-bot bot commented Jun 4, 2024 •

edited

Loading

Refactor rest of tinygemm quant primitive ops #321

Refactor rest of tinygemm quant primitive ops #321

Conversation

jerryzh168 commented Jun 4, 2024

pytorch-bot bot commented Jun 4, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/321

✅ No Failures

pytorch-bot bot commented Jun 4, 2024 •

edited

Loading