-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【PaddlePaddle Hackathon 5 No.48】Fix bug that thread configuration parameters are out of bounds #58307
【PaddlePaddle Hackathon 5 No.48】Fix bug that thread configuration parameters are out of bounds #58307
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
@luotao1 涛姐这个PR改动超千行了,麻烦通过一下~~ |
@wanghuancoder 您提到的 |
Sorry to inform you that dc039bd's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…ameters are out of bounds (PaddlePaddle#58307) * modified: paddle/phi/kernels/gpu/strided_copy_kernel.cu * modified: VerifyThreadConfigurationParameters * fix bugs
PR types
Bug fixes
PR changes
OPs
Description
StridedCopyKernel在某些情况下,block.x、block.y、block.z、grid.y、grid.z的数值,会变的极端:过大,会报错;过小,性能会受到影响。
硬件限制
block.x <= 1024
block.y <= 1024
block.z <= 64
block.x * block.y * block.z >= 96(小于96时性能大幅降低)
block.x * block.y * block.z <= 1024
grid.x 无限制
grid.y < 2^16
grid.z < 2^16
关联PR
【PaddlePaddle Hackathon 5 No.48】ContiguousKernel、StridedCopyKernel算子CPU、GPU性能优化 -part #57835