Compression API supports strategy QAT #3271

LiuChiachi · 2022-09-14T12:51:45Z

PR types

New features

PR changes

APIs

Description

Compression API supports strategy QAT
DONE:

支持QAT
优化了compress()接口，不需要输入custom_dynabert_calc_loss，loss function直接提供在Trainer 初始化时；
更新了文档

Usage:

cd PaddleNLP/model_zoo/ernie-3.0

python compress_seq_cls.py \
    --dataset   "clue cluewsc2020"   \
    --model_name_or_path ernie-3.0-nano-zh \
    --per_device_train_batch_size 32 \
    --output_dir ./test \
    --per_device_eval_batch_size 32 \
    --num_train_epochs 5 \
    --width_mult_list 2/3 \
    --strategy 'qat' \
    --batch_size_list 4 \
    --algo_list 'abs_max' \



python compress_token_cls.py      --dataset   "msra_ner"   \
    --model_name_or_path best_models/MSRA_NER/   \
    --output_dir ./  --remove_unused_columns False   \
    --max_seq_length 32    \
    --per_device_train_batch_size 32   \
    --per_device_eval_batch_size 32  \
    --learning_rate 0.00005    \
    --remove_unused_columns False  \
    --num_train_epochs 1 \
    --batch_size_list 4 \
    --algo_list 'abs_max' \
    --strategy 'qat'


python compress_qa.py \
    --dataset "clue cmrc2018" \
    --width_mult_list 2/3 \
    --model_name_or_path best_models/CMRC2018  \
    --output_dir ./ \
    --max_seq_length 32 \
    --learning_rate 0.00003 \
    --num_train_epochs 1 \
    --per_device_train_batch_size 24 \
    --per_device_eval_batch_size 24 \
    --max_answer_length 50 \
    --strategy 'qat' \
    --batch_size_list 4 \
    --algo_list 'abs_max' \

…nto support-qat

wawltor · 2022-10-17T03:46:03Z

paddlenlp/transformers/ofa_utils.py

+                batch.pop("length")
+            if "seq_len" in batch:
+                batch.pop("seq_len")
+        elif "start_positions" in batch and "end_positions" in batch:


这种分支代码太多，是不是可以通过配置List来解决

感谢提醒，已经修改

wawltor · 2022-10-17T03:48:47Z

paddlenlp/trainer/trainer_compress.py

-                                            dtype="int64")  # input_ids
-                ]
-
+            input_spec = generate_input_spec(self.model, self.train_dataset)


看了一下函数中的写法，不考 start_positions 和 end_positions 这个应该是针对UIE写法，不过 start_positions 和 end_positions 这两个字段也不会出现在UIE的模型中

start_positions和end_positions这个在qa任务和UIE里都有，在原文中抽取的任务会有的。这个函数是想通过forward的参数、dataloader的数据来判断input_spec的个数，需要排除掉labels/start_positions和end_positions

…nto support-qat

wawltor

LGTM

This reverts commit 217a25c.

LiuChiachi self-assigned this Sep 14, 2022

LiuChiachi added the model-compression label Sep 14, 2022

LiuChiachi added 2 commits September 14, 2022 13:33

supports strategy 'qat'

eaf3cd1

solve conflicts

bc6db98

LiuChiachi force-pushed the support-qat branch 3 times, most recently from d8c9cbb to 6af10e8 Compare September 29, 2022 12:49

LiuChiachi marked this pull request as ready for review September 29, 2022 12:49

Update UIE QAT

4a2f42f

LiuChiachi requested a review from wawltor October 14, 2022 04:10

Update UIE QAT

96b116e

LiuChiachi force-pushed the support-qat branch from 6af10e8 to 96b116e Compare October 14, 2022 05:44

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

18fac4a

…nto support-qat

LiuChiachi force-pushed the support-qat branch from 8d89e0d to 18fac4a Compare October 17, 2022 03:33

wawltor reviewed Oct 17, 2022

View reviewed changes

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

fb219bc

…nto support-qat

LiuChiachi force-pushed the support-qat branch from 2beaa20 to fb219bc Compare October 17, 2022 05:59

wawltor approved these changes Oct 17, 2022

View reviewed changes

Merge branch 'develop' into support-qat

a178efd

LiuChiachi merged commit 217a25c into PaddlePaddle:develop Oct 17, 2022

joey12300 added a commit to joey12300/PaddleNLP that referenced this pull request Oct 18, 2022

Revert "supports strategy 'qat' (PaddlePaddle#3271)"

a86acee

This reverts commit 217a25c.

LiuChiachi mentioned this pull request Jan 12, 2023

PaddleNLP 2.5.0 Release Note Candidate #4439

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compression API supports strategy QAT #3271

Compression API supports strategy QAT #3271

LiuChiachi commented Sep 14, 2022 •

edited

Loading

wawltor Oct 17, 2022

LiuChiachi Oct 17, 2022

wawltor Oct 17, 2022

LiuChiachi Oct 17, 2022

wawltor left a comment

Compression API supports strategy QAT #3271

Compression API supports strategy QAT #3271

Conversation

LiuChiachi commented Sep 14, 2022 • edited Loading

PR types

PR changes

Description

wawltor Oct 17, 2022

Choose a reason for hiding this comment

LiuChiachi Oct 17, 2022

Choose a reason for hiding this comment

wawltor Oct 17, 2022

Choose a reason for hiding this comment

LiuChiachi Oct 17, 2022

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

LiuChiachi commented Sep 14, 2022 •

edited

Loading