kompute: improve backend to pass test_backend_ops #10542

slp · 2024-11-27T11:11:04Z

This is a first batch of improvements on the kompute backend to be able to pass test_backend_ops by fixing some bugs and adding some missing features. Tested on Apple Silicon (M1 GPU) and AMD (Vega 8).

The next batch will extend test_backend_ops coverage by adding support for more operations.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Signed-off-by: Sergio Lopez <slp@redhat.com>

slaren · 2024-11-27T11:20:06Z

I tried running test-backend-ops under Windows with NVIDIA GPUs. The first GPU passed, but the second one hangs in the first op:

Testing 3 devices

Backend 1/3: Kompute0
  Device description: NVIDIA GeForce RTX 3080
  Device memory: 10053 MB (10053 MB free)
  [...]
  1919/1919 tests passed
  Backend Kompute0: OK

Backend 2/3: Kompute1
  Device description: NVIDIA GeForce RTX 3090 Ti
  Device memory: 24313 MB (24313 MB free)

  ABS(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  ABS(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  SGN(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  SGN(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  NEG(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  NEG(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  STEP(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  STEP(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  TANH(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  TANH(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  ELU(type=f32,ne_a=[128,2,2,2],v=0): not supported [Kompute1]
  ELU(type=f32,ne_a=[5,7,11,13],v=0): not supported [Kompute1]
  RELU(type=f32,ne_a=[128,2,2,2],v=0):

slp · 2024-11-27T11:31:35Z

I tried running test-backend-ops under Windows with NVIDIA GPUs. The first GPU passed, but the second one hangs in the first op:

Thanks for letting me know. I guess the most likely explanation is that the kompute backend has trouble initializing devices other than the first one. I'll try to find some hardware to fix it in a future PR.

slp added 7 commits November 26, 2024 03:52

kompute: op_unary: reject unsupported parameters

913536f

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: softmax: implement ALiBi support

d888959

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: rope: implement neox and phi3 support

1b8afa8

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: op_mul_mat_q4_k permutted support

2ac1d0e

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: op_mul_mat_[q4_0|q4_1|q8_0] permutted support

9c5bdf4

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: op_mul_mat_f16 permutted support

f54c96e

Signed-off-by: Sergio Lopez <slp@redhat.com>

kompute: op_mul_mat_q6_k permutted support

0e3d85d

Signed-off-by: Sergio Lopez <slp@redhat.com>

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ labels Nov 27, 2024

slaren approved these changes Nov 27, 2024

View reviewed changes

slp changed the title ~~kompute: improve backend for pass test_backend_ops~~ kompute: improve backend to pass test_backend_ops Nov 28, 2024

slp merged commit 2025fa6 into ggerganov:master Nov 28, 2024
50 checks passed

slp deleted the kompute-fix-tests branch November 28, 2024 11:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kompute: improve backend to pass test_backend_ops #10542

kompute: improve backend to pass test_backend_ops #10542

slp commented Nov 27, 2024 •

edited

Loading

slaren commented Nov 27, 2024

slp commented Nov 27, 2024

kompute: improve backend to pass test_backend_ops #10542

kompute: improve backend to pass test_backend_ops #10542

Conversation

slp commented Nov 27, 2024 • edited Loading

slaren commented Nov 27, 2024

slp commented Nov 27, 2024

slp commented Nov 27, 2024 •

edited

Loading