bugfix: parallel_executor for xpu should use BindThreadedSSAGraphExec… #32792
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
PR types
Bug fixes
PR changes
Others
Describe
在下面的这个commit中
faa8c70
有一部分关于xpu的代码丢失了。
具体是:
faa8c70#diff-9507db329e07d063070d3e632b4efa0ef262e8730af0038fdfbb6d1653bcd45cL952
从这里开始的少数几行xpu相关的代码丢掉了。导致的结果是,在K200的单卡2PD的时候,模型训练速度会显著下降。
例如,在PaddleClas上面训练ResNet50的时候,单PD上的速度会从65-70images/s下降到约39images/s。
这个PR只是把前面提到的commit里面丢失的代码恢复回来而已。