-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Targeting Q4] test_triangular_solve_op fails on "Intel(R) Xeon(R) Silver 4314 CPU" #55707
Comments
你好,你的问题已经收到,分析中 |
@Tom-Zheng 你好,请问运行的是CPU版本还是GPU版本的paddle,这个OP单测我们内部运行没问题 https://xly.bce.baidu.com/paddlepaddle/paddle/newipipe/detail/8988447/job/23602891 |
请看描述, 用"Intel(R) Xeon(R) Silver 4314 CPU"才能复现该问题. |
我们运行的是GPU版的Paddle, 但该UT是CPU failure, 因此CPU版也应该能够复现. |
@Tom-Zheng 我们在内部的多种CPU机型上运行都是可以通过的。 triangular_solve计算在CPU上,使用的是intel提供的mklml库,可能是该库在这种CPU上有计算问题? 所以可以测一下openblas版本的paddle,是否有同样问题:
同时确认GPU版本是否有同样问题。如果openblas、GPU都可以运行通过,则可以基本确定是intel mklml库的原因。 |
Will come back to this issue in Q4. |
The problem is gone after updating CBLAS from v0.3.18 to v0.3.24. |
bug描述 Describe the Bug
The CPU kernel of
triangular_solve
breaks on Intel(R) Xeon(R) Silver 4314 CPU. This will causetest_triangular_solve_op
failure.test_lu_op
andtest_qr_op
are also affected because they rely ontriangular_solve
.Paddle version: release/2.5
Error info:
CPU info:
其他补充信息 Additional Supplementary Information
No response
The text was updated successfully, but these errors were encountered: