Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] How to get act_scales of custom llama-like model? How much calibration data items do we need? Need act_zeros simultaneously? #10

Open
hanrui1sensetime opened this issue Dec 7, 2023 · 3 comments

Comments

@hanrui1sensetime
Copy link

We want to try QUIK on our self-implemented llama-like model weights. We found that may be there is no script about how to generate act_scales .pt files.
So we use calibration data items to quant activation and save it first? How much items should we use, and need act_zeros too?

I'm looking forward to the reply soon.
Thanks.

@hanrui1sensetime
Copy link
Author

I have found the script in SmoothQuant repo. So I will close the issue.

@VityaVitalich
Copy link

I have found the script in SmoothQuant repo. So I will close the issue.

Could you please share how the problem was solved?

@VityaVitalich
Copy link

Found it quite easily. For the further interest, one could find the code for scales generation here. All the needed info is described in README

https://github.com/mit-han-lab/smoothquant/tree/main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants