Add sequence parallel strategy support. #734

GhostScreaming · 2022-09-16T07:31:41Z

Add sequence parallel strategy for GPTModelHybrid
Output has been checked layer by layer both in forward
and backward progress, and its loss curve of the beginning
5000 steps fits the peer
Performance is improved for about 10% with sequence_parallel
strategy compared with pretrain_gpt_1.3B_mp8

1. Add sequence parallel strategy for GPTModelHybrid 2. Output has been checked layer by layer both in forward and backward progress, and its loss curve of the beginning 5000 steps fits the peer 3. Performance is improved for about 10% with sequence_parallel strategy compared with pretrain_gpt_1.3B_mp8

ForFishes

LGTM

GhostScreaming added 2 commits September 16, 2022 07:14

Add sequence_parallel_utils.py file

f73dabe

sneaxiy approved these changes Sep 16, 2022

View reviewed changes

GhostScreaming force-pushed the sequence_parallel branch from a6d58c6 to f73dabe Compare September 16, 2022 07:53

ForFishes approved these changes Sep 16, 2022

View reviewed changes

ForFishes merged commit 85870f8 into PaddlePaddle:develop Sep 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sequence parallel strategy support. #734

Add sequence parallel strategy support. #734

GhostScreaming commented Sep 16, 2022 •

edited

Loading

ForFishes left a comment

Add sequence parallel strategy support. #734

Add sequence parallel strategy support. #734

Conversation

GhostScreaming commented Sep 16, 2022 • edited Loading

ForFishes left a comment

Choose a reason for hiding this comment

GhostScreaming commented Sep 16, 2022 •

edited

Loading