[SYCL] Fix DMMV dequantization #9279

OuadiElfarouki · 2024-09-02T16:24:28Z

MUL_MAT test-backend-ops currently fail on intel GPUs for Q4_1, Q5_0, Q5_1 and Q8_0 due to a small edge-case issue in the dequantize_mul_mat_vec kernel (when ncols <= GGML_SYCL_DMMV_X specifically).

This is a minor fix that halts the access to out-bound/extra quant elements in the kernel reduction step.

All unit-tests are passing following this fix.
Performance on intel GPUs is almost not affected.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

joeatodd · 2024-09-03T12:58:31Z

I think given the perf implications of this bounds checking, we should dig a little deeper.

OuadiElfarouki · 2024-09-03T13:01:04Z

@joeatodd Agree

OuadiElfarouki · 2024-09-04T09:36:29Z

Updated fix and performance is preserved now.

joeatodd

LGTM

ggml/src/ggml-sycl/dmmv.cpp

Fixed dmmv dequant for ncols== GGML_SYCL_DMMV_X

OuadiElfarouki added 2 commits September 2, 2024 14:06

Fixed dmmv dequant for k<= GGML_SYCL_DMMV_X

3ba9d04

removed unecessary condition

da18950

OuadiElfarouki requested review from joeatodd and airMeng September 2, 2024 16:24

github-actions bot added the SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language label Sep 2, 2024

rectified dmmv quant fix

8cdbe11

airMeng approved these changes Sep 4, 2024

View reviewed changes

joeatodd approved these changes Sep 4, 2024

View reviewed changes

ggml/src/ggml-sycl/dmmv.cpp Outdated Show resolved Hide resolved

Removed unecessary loop unrolling

24bfbde

OuadiElfarouki merged commit 5910ea9 into ggerganov:master Sep 4, 2024
52 checks passed

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

[SYCL] Fix DMMV dequantization (ggerganov#9279)

3487497

Fixed dmmv dequant for ncols== GGML_SYCL_DMMV_X

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

[SYCL] Fix DMMV dequantization (ggerganov#9279)

c6e616b

Fixed dmmv dequant for ncols== GGML_SYCL_DMMV_X

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Fix DMMV dequantization #9279

[SYCL] Fix DMMV dequantization #9279

OuadiElfarouki commented Sep 2, 2024 •

edited

Loading

joeatodd commented Sep 3, 2024

OuadiElfarouki commented Sep 3, 2024

OuadiElfarouki commented Sep 4, 2024

joeatodd left a comment

[SYCL] Fix DMMV dequantization #9279

[SYCL] Fix DMMV dequantization #9279

Conversation

OuadiElfarouki commented Sep 2, 2024 • edited Loading

joeatodd commented Sep 3, 2024

OuadiElfarouki commented Sep 3, 2024

OuadiElfarouki commented Sep 4, 2024

joeatodd left a comment

Choose a reason for hiding this comment

OuadiElfarouki commented Sep 2, 2024 •

edited

Loading