[LLM] Add pipeline and flashmask for Qwen2Moe and Deepseek #9827

DrownFish19 · 2025-02-01T11:11:27Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

New features

PR changes

Models

Description

Add pipeline and flashmask for Qwen2Moe and Deepseek.

…d_pipeline_for_moe

codecov · 2025-02-01T11:49:56Z

Codecov Report

Attention: Patch coverage is 33.39318% with 371 lines in your changes missing coverage. Please review.

Project coverage is 52.16%. Comparing base (bad2240) to head (3a320cb).
Report is 2 commits behind head on develop.

❗ Current head 3a320cb differs from pull request most recent head 47628e4

Please upload reports for the commit 47628e4 to get more accurate results.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/deepseek_v2/modeling_pp.py	23.60%	123 Missing ⚠️
paddlenlp/transformers/qwen2_moe/modeling_pp.py	23.12%	123 Missing ⚠️
paddlenlp/transformers/qwen2_moe/modeling.py	58.06%	52 Missing ⚠️
paddlenlp/transformers/deepseek_v2/modeling.py	17.54%	47 Missing ⚠️
paddlenlp/transformers/moe_gate.py	38.70%	19 Missing ⚠️
paddlenlp/trl/llm_utils.py	0.00%	3 Missing ⚠️
paddlenlp/transformers/deepseek_v3/modeling.py	0.00%	2 Missing ⚠️
paddlenlp/transformers/llama/fusion_ops.py	0.00%	2 Missing ⚠️

❌ Your patch check has failed because the patch coverage (33.39%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project check has failed because the head coverage (52.16%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9827      +/-   ##
===========================================
+ Coverage    51.45%   52.16%   +0.70%     
===========================================
  Files          737      733       -4     
  Lines       119471   116187    -3284     
===========================================
- Hits         61472    60604     -868     
+ Misses       57999    55583    -2416

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

paddle-bot · 2025-02-05T04:48:41Z

Thanks for your contribution!

…r_moe

…d_pipeline_for_moe

QingshuChen · 2025-02-08T08:02:03Z

paddlenlp/transformers/moe_gate.py

+        assert self.e_score_correction_bias is not None, "e_score_correction_bias is None"
+        scores = scores.reshape([bsz_seq_len, -1]) + self.e_score_correction_bias.unsqueeze(0)
+        group_scores = scores.reshape([bsz_seq_len, self.n_group, -1]).topk(2, axis=-1)[0].sum(axis=-1)  # [n, n_group]
+        group_idx = paddle.topk(group_scores, k=topk_group, axis=-1, sorted=False)[1]  # [n, top_k_group]


这边sorted=False对gpu不生效, 应该怎么修改呀?

DrownFish19 added 5 commits January 26, 2025 15:35

add modleing_pp

741b8e7

add modleing_pp for qwen2moe

cf82bcc

add flashmask and pp for Qwen2MoE and Deepseek

d646dba

remove

3fcf2c1

Merge remote-tracking branch 'paddlenlp/develop' into dev_20250126_ad…

3a320cb

…d_pipeline_for_moe

fix fast_tokenizer save

d55f559

DrownFish19 added 5 commits February 6, 2025 10:19

update for topk_weight of noaux_tc

b104eaa

Merge branch 'PaddlePaddle:develop' into dev_20250126_add_pipeline_fo…

4c7f5d6

…r_moe

fix for flashmask

ecad2f1

Merge remote-tracking branch 'paddlenlp/develop' into dev_20250126_ad…

446b4da

…d_pipeline_for_moe

add use_expert_parallel for pretrain

80f5c98

QingshuChen reviewed Feb 8, 2025

View reviewed changes

fix tokenizer test

47628e4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add pipeline and flashmask for Qwen2Moe and Deepseek #9827

[LLM] Add pipeline and flashmask for Qwen2Moe and Deepseek #9827

DrownFish19 commented Feb 1, 2025

codecov bot commented Feb 1, 2025 •

edited

Loading

paddle-bot bot commented Feb 5, 2025

QingshuChen Feb 8, 2025

[LLM] Add pipeline and flashmask for Qwen2Moe and Deepseek #9827

Are you sure you want to change the base?

[LLM] Add pipeline and flashmask for Qwen2Moe and Deepseek #9827

Conversation

DrownFish19 commented Feb 1, 2025

Before submitting

PR types

PR changes

Description

codecov bot commented Feb 1, 2025 • edited Loading

Codecov Report

paddle-bot bot commented Feb 5, 2025

QingshuChen Feb 8, 2025

Choose a reason for hiding this comment

codecov bot commented Feb 1, 2025 •

edited

Loading