Add Embedding quantization #4159

LiuChiachi · 2022-12-19T09:33:16Z

PR types

New features

PR changes

APIs

Description

Add Embedding quantization

PYTHONPATH=/liujiaqi/PaddleNLP:/liujiaqi/PaddleSlim python compress_seq_cls.py    \
    --dataset   "clue cluewsc2020"     \
    --model_name_or_path ernie-3.0-tiny-nano-v2-zh   \
    --per_device_train_batch_size 32   \
    --output_dir ./test  \
    --per_device_eval_batch_size 32    \
    --num_train_epochs 5   \
    --width_mult_list 2/3   \
    --batch_size_list 4   \
    --algo_list 'abs_max'   \
    --strategy 'dynabert ptq embeddings'  \
    --onnx_format False

paddle-bot · 2022-12-19T09:33:20Z

Thanks for your contribution!

codecov · 2022-12-19T09:50:42Z

Codecov Report

Merging #4159 (9cfb4f9) into develop (c73a3a0) will decrease coverage by 0.01%.
The diff coverage is 8.10%.

@@             Coverage Diff             @@
##           develop    #4159      +/-   ##
===========================================
- Coverage    36.39%   36.38%   -0.02%     
===========================================
  Files          419      419              
  Lines        59059    59089      +30     
===========================================
+ Hits         21496    21499       +3     
- Misses       37563    37590      +27

Impacted Files	Coverage Δ
paddlenlp/trainer/trainer_compress.py	`8.70% <8.10%> (+0.07%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

wawltor · 2022-12-21T02:30:55Z

paddlenlp/trainer/trainer_compress.py

-            self.quant(input_dir, args.strategy)
-    elif args.strategy == "qat":
+            output_dir_list = self.quant(input_dir, "ptq")
+        print(output_dir_list)


print可以删除

感谢提醒，已经删除

wawltor · 2022-12-21T02:33:27Z

paddlenlp/trainer/trainer_compress.py

@@ -138,7 +153,7 @@ def _dynabert(self, model, output_dir):
    ofa_model = _dynabert_training(
        self, ofa_model, model, teacher_model, train_dataloader, eval_dataloader, args.num_train_epochs
    )
-
+    self.reset_optimizer_and_scheduler()


这里reset优化器和学习率的原因是什么了?

这里是dynabert的部分，训练完毕后reset 了优化器和学习率，目的是如果后面接入了 qat，防止qat的优化器和学习率是接着dynabert的部分来的而不是自己独立的从头开始。

wawltor · 2022-12-21T02:34:57Z

paddlenlp/trainer/trainer_compress.py

+
+    input_dir = os.path.dirname(input_prefix)
+
+    paddle.fluid.io.save_inference_model(


这里建议不要再使用fluid相关的API

目前只有这个 API 可以直接使用，新版本的paddle.static.save_inference_model输入需要feed_var，fluid API 只需要feed_var_names，而feed_var_names可以通过上面load_inference_model给出

…nto add-quant-emb

wawltor

LGTM

add quant emb

9ac7881

support quant embeddings

7037d1f

wawltor reviewed Dec 21, 2022

View reviewed changes

LiuChiachi added 2 commits December 27, 2022 03:49

remove useless log

2a5c891

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

9cfb4f9

…nto add-quant-emb

LiuChiachi requested a review from wawltor December 28, 2022 12:00

wawltor approved these changes Dec 29, 2022

View reviewed changes

wawltor merged commit 5d542a2 into PaddlePaddle:develop Dec 29, 2022

LiuChiachi mentioned this pull request Jan 11, 2023

PaddleNLP 2.5.0 Release Note Candidate #4439

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Embedding quantization #4159

Add Embedding quantization #4159

LiuChiachi commented Dec 19, 2022 •

edited

Loading

paddle-bot bot commented Dec 19, 2022

codecov bot commented Dec 19, 2022 •

edited

Loading

wawltor Dec 21, 2022

LiuChiachi Dec 27, 2022

wawltor Dec 21, 2022

LiuChiachi Dec 27, 2022

wawltor Dec 21, 2022

LiuChiachi Dec 27, 2022

wawltor left a comment


		input_dir = os.path.dirname(input_prefix)

		paddle.fluid.io.save_inference_model(

Add Embedding quantization #4159

Add Embedding quantization #4159

Conversation

LiuChiachi commented Dec 19, 2022 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Dec 19, 2022

codecov bot commented Dec 19, 2022 • edited Loading

Codecov Report

wawltor Dec 21, 2022

Choose a reason for hiding this comment

LiuChiachi Dec 27, 2022

Choose a reason for hiding this comment

wawltor Dec 21, 2022

Choose a reason for hiding this comment

LiuChiachi Dec 27, 2022

Choose a reason for hiding this comment

wawltor Dec 21, 2022

Choose a reason for hiding this comment

LiuChiachi Dec 27, 2022

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

LiuChiachi commented Dec 19, 2022 •

edited

Loading

codecov bot commented Dec 19, 2022 •

edited

Loading