-
-
Notifications
You must be signed in to change notification settings - Fork 202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No XgemmHalf in database on development branch? #96
Comments
Yes, there is. I have only tested half-precision (fp16) on Intel GPUs with Beignet OpenCL. So far, I couldn't make the kernel work properly with Beignet under Linux. Thus, I haven't been able to run the tuners. As a result there are no entries for fp16 GEMM in the database. I don't have access to other hardware supporting fp16, so I can't say for 100% that it is a Beignet-related issue - it could of course also be a bug in CLBlast. But until I have other fp16 hardware available (or someone else runs the tuners), it isn't fully tested. |
On Mali-T628 (OpenCL r10), the kernel runs, but there are test failures (not all: 85 passed / 278 skipped / 149 failed). Is this what you mean by "not working properly"? |
And if I swap the default kernel with the one written by ARM, then the numbers are 87/278/147. Couldn't it be an issue with precision? |
With Beignet I haven't managed to get the tuner working for HGEMM. I haven't tuned for Mali FP16, that's why the parameters are not included in the database. Feel free to do so and upload the results to #1. After that's done the next step would be to look at correctness. |
BTW, how does it support half-precision? Here (HD4600, beignet from git master) CLBlast says -2045. That is, I don't have |
The following 2 devices at least support
Could very well be that your hardware doesn't support FP16. |
Ah, that matches the situation with proprietary drivers. Looks like I'm out of luck to verify this. |
I changed the database script such that it now generates a default parameter set based on 32-bit precision (e.g SGEMM) in case there is no entry yet (e.g. HGEMM). This fixes your issue. |
Is there any specific reason why there are no entries (not even the default) for half-precision GEMM on the
development
branch?The text was updated successfully, but these errors were encountered: