[LLM] Add expert parallel #9368

DrownFish19 · 2024-11-05T03:36:19Z

PR types

New features

PR changes

APIs

Description

Add expert parallel

added in Qwen2Moe

1. 精度验证代码：

import paddle
from paddlenlp.transformers.qwen2_moe.modeling import Qwen2MoeSparseMoEBlock, Qwen2MoeSparseMoEBlock_OLD

from paddlenlp.transformers import Qwen2MoeConfig

config = Qwen2MoeConfig.from_pretrained("Qwen/Qwen2-57B-A14B")
paddle.set_default_dtype(paddle.float32)

with paddle.amp.auto_cast(True):
    block = Qwen2MoeSparseMoEBlock(config)
    block_old = Qwen2MoeSparseMoEBlock_OLD(config)

state_dict = block.state_dict()
block_old.set_state_dict(state_dict)

for seq_len in [i * 1024 for i in range(1, 128)]:
    hidden_state = paddle.rand([1, 1024, config.hidden_size], dtype=paddle.float32).cast(paddle.get_default_dtype())

    block_output = block(hidden_state)
    block_output_old = block_old(hidden_state)

    print(seq_len, ": ", float(paddle.max(paddle.abs(block_output[0] - block_output_old[0]))))

2. 精度验证结果：

同Qwen2Moe原始Moe计算代码（没有专家并行）比较，序列长度计算1k-128k，最大diff保持不变。

float32 diff: 0.0
float16 diff: 1e-4
bfloat16 diff: 9e-4

…d_expert_parallel

…llel

…d_expert_parallel

…llel

…d_expert_parallel

paddle-bot · 2024-11-05T03:36:23Z

Thanks for your contribution!

codecov · 2024-11-05T04:07:28Z

Codecov Report

Attention: Patch coverage is 50.27473% with 181 lines in your changes missing coverage. Please review.

Project coverage is 52.89%. Comparing base (b5e3f0c) to head (358483b).
Report is 21 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/moe_gate.py	42.29%	131 Missing ⚠️
paddlenlp/transformers/moe_layer.py	54.62%	49 Missing ⚠️
paddlenlp/transformers/qwen2_moe/modeling.py	95.83%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9368      +/-   ##
===========================================
- Coverage    53.01%   52.89%   -0.12%     
===========================================
  Files          678      678              
  Lines       108787   108249     -538     
===========================================
- Hits         57668    57262     -406     
+ Misses       51119    50987     -132

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

…d_expert_parallel

wawltor

LGTM

DrownFish19 and others added 20 commits October 18, 2024 03:07

add expert parallel utils

9414d67

update gates

a6260cf

update

39d1660

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

86ee3cc

…d_expert_parallel

update base methods

cc41578

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

0fcba13

…d_expert_parallel

update moe_layer

2b74f30

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

2a24fda

…d_expert_parallel

update moebase

f517473

add moe_gate and moe_layer for qwen2moe

1a3399e

add config

d6a16eb

Merge branch 'PaddlePaddle:develop' into dev_20241018_add_expert_para…

440b848

…llel

update

fad1a4f

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

e0f3e93

…d_expert_parallel

update gate dtype

8701b52

Merge branch 'PaddlePaddle:develop' into dev_20241018_add_expert_para…

4af8a68

…llel

Merge branch 'PaddlePaddle:develop' into dev_20241018_add_expert_para…

4ef7d4f

…llel

update moe gate and layer

448ecbd

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

0a3af3b

…d_expert_parallel

update moe_layer.py

77ec9b0

DrownFish19 added 8 commits November 6, 2024 02:12

update

ff93012

update

de2d257

update token_priority method

83bdedd

update data type

63f6755

remove old moe

17537b3

Merge remote-tracking branch 'paddlenlp/develop' into dev_20241018_ad…

88a91a1

…d_expert_parallel

fix moe capacity reduce.Max

2b0bf16

update comment

801f0ff

lint

358483b

wawltor approved these changes Nov 20, 2024

View reviewed changes

wawltor merged commit 590081a into PaddlePaddle:develop Nov 20, 2024
9 of 12 checks passed

DrownFish19 deleted the dev_20241018_add_expert_parallel branch November 20, 2024 07:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add expert parallel #9368

[LLM] Add expert parallel #9368

DrownFish19 commented Nov 5, 2024 •

edited

Loading

paddle-bot bot commented Nov 5, 2024

codecov bot commented Nov 5, 2024 •

edited

Loading

wawltor left a comment

[LLM] Add expert parallel #9368

[LLM] Add expert parallel #9368

Conversation

DrownFish19 commented Nov 5, 2024 • edited Loading

PR types

PR changes

Description

1. 精度验证代码：

2. 精度验证结果：

paddle-bot bot commented Nov 5, 2024

codecov bot commented Nov 5, 2024 • edited Loading

Codecov Report

wawltor left a comment

Choose a reason for hiding this comment

DrownFish19 commented Nov 5, 2024 •

edited

Loading

codecov bot commented Nov 5, 2024 •

edited

Loading