Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Use
Int8DynActInt4WeightQuantizer
in torchao (#2551)
Summary: Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #2551 att bypass-github-export-checks bypass-github-pytorch-ci-checks bypass-github-executorch-ci-checks Pull Request resolved: #2551 Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewed By: andrewor14 Differential Revision: D55221981 Pulled By: jerryzh168 fbshipit-source-id: 8f59df461416d5a5bbdc53f3a6baba331ee3f4ce
- Loading branch information