Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bugfix: parallel_executor for xpu should use BindThreadedSSAGraphExec… #32792

Merged

Conversation

houj04
Copy link
Contributor

@houj04 houj04 commented May 7, 2021

PR types

Bug fixes

PR changes

Others

Describe

在下面的这个commit中
faa8c70
有一部分关于xpu的代码丢失了。

具体是:
faa8c70#diff-9507db329e07d063070d3e632b4efa0ef262e8730af0038fdfbb6d1653bcd45cL952
从这里开始的少数几行xpu相关的代码丢掉了。导致的结果是,在K200的单卡2PD的时候,模型训练速度会显著下降。
例如,在PaddleClas上面训练ResNet50的时候,单PD上的速度会从65-70images/s下降到约39images/s。

这个PR只是把前面提到的commit里面丢失的代码恢复回来而已。

@paddle-bot-old
Copy link

paddle-bot-old bot commented May 7, 2021

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copy link
Contributor

@Aurelius84 Aurelius84 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Aurelius84 Aurelius84 merged commit e8e4a9c into PaddlePaddle:develop May 8, 2021
@houj04 houj04 deleted the fix-parallel-executor-for-xpu branch May 8, 2021 05:54
houj04 added a commit to houj04/Paddle that referenced this pull request May 17, 2021
@houj04 houj04 restored the fix-parallel-executor-for-xpu branch September 7, 2021 03:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants