Add weight quantization in post_training_quanzitaion, test=develop #22445

juncaipeng · 2020-02-04T12:27:03Z

In order to reduce the size of model, this api quantizes the weight of some ops from float32 to int8/16. In the inference stage, the quantized weight will be dequantized to float32 again.

wzzju

LGTM.

wzzju · 2020-02-05T03:46:12Z

python/paddle/fluid/contrib/slim/quantization/post_training_quantization.py

+                weight, and it should be 8 or 16. Default is 8.
+            threshold_rate(float, optional): This api uses abs_max methd to 
+                quantize the weight from float32 to int8/16, and the abs max 
+                value is important for quantization diff. When the is far 


When the is far?

wzzju

LGTM.

) * support weight quantization in post_training_quanzitaion, test=develop * add test for weight quantization, test=develop

tangtang586 · 2020-02-10T06:25:16Z

this op name mentioned in the title should be post_training_quantization.

…22445) (#22493) * Add weight quantization in post_training_quanzitaion (#22445) * [cherry-pick]Support int16 for Tensor (#22423) * add int16 support, test=develop, test=release/1.7 Co-authored-by: Leo Chen <chenqiuliang@baidu.com>

support weight quantization in post_training_quanzitaion, test=develop

b34df06

juncaipeng mentioned this pull request Feb 4, 2020

Support weight quantization PaddlePaddle/Paddle-Lite#2791

Merged

add test for weight quantization, test=develop

7eab21c

wzzju previously approved these changes Feb 5, 2020

View reviewed changes

up, test=develop

00cec87

wzzju dismissed their stale review via 00cec87 February 5, 2020 05:36

juncaipeng closed this Feb 5, 2020

juncaipeng reopened this Feb 5, 2020

juncaipeng closed this Feb 5, 2020

juncaipeng reopened this Feb 5, 2020

juncaipeng closed this Feb 5, 2020

juncaipeng reopened this Feb 5, 2020

juncaipeng added 2 commits February 5, 2020 22:08

up, test=develop

bd9ae88

up, test=develop

59966bb

wzzju approved these changes Feb 6, 2020

View reviewed changes

juncaipeng merged commit 197913e into PaddlePaddle:develop Feb 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add weight quantization in post_training_quanzitaion, test=develop #22445

Add weight quantization in post_training_quanzitaion, test=develop #22445

juncaipeng commented Feb 4, 2020

wzzju left a comment

wzzju Feb 5, 2020

juncaipeng Feb 5, 2020

wzzju left a comment

tangtang586 commented Feb 10, 2020

Add weight quantization in post_training_quanzitaion, test=develop #22445

Add weight quantization in post_training_quanzitaion, test=develop #22445

Conversation

juncaipeng commented Feb 4, 2020

wzzju left a comment

Choose a reason for hiding this comment

wzzju Feb 5, 2020

Choose a reason for hiding this comment

juncaipeng Feb 5, 2020

Choose a reason for hiding this comment

wzzju left a comment

Choose a reason for hiding this comment

tangtang586 commented Feb 10, 2020