Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to obtain the min and max values of activation and weights? #28

Closed
CangHaiQingYue opened this issue Mar 5, 2024 · 2 comments
Closed
Assignees

Comments

@CangHaiQingYue
Copy link

I want to deploy a qat model on the chip, but the chip requires min max values for activation and weights. What should I do?
The format of the JSON file is as follows. In fact, this is the format of AIMET. I found that mAP of this Repositorie is better than AIMET.
{
"activation_encodings": {
"115": [
{
"bitwidth": 8,
"dtype": "int",
"is_symmetric": "False",
"max": 9.063658714294434,
"min": 0.0,
"offset": 0,
"scale": 0.03554375842213631
}
],
"param_encodings": {
"conv1.weight": [
{
"bitwidth": 8,
"dtype": "int",
"is_symmetric": "True",
"max": 0.08543740957975388,
"min": -0.08611015230417252,
"offset": -128,
"scale": 0.0006727355648763478
},

@lynettez lynettez self-assigned this Mar 25, 2024
@lynettez
Copy link
Collaborator

what chip do you use? is it a common question for TensorRT? could you new an issue in https://github.com/NVIDIA/TensorRT/issues?

@lynettez
Copy link
Collaborator

lynettez commented Sep 2, 2024

closing since no activity for several months, thanks!

@lynettez lynettez closed this as completed Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants