Add ROCM aliases for CUDA pool stuff #3918

KerfuffleV2 · 2023-11-02T19:56:06Z

Hey, guess what? ROCM is broken again! The good news is it seems like an easy fix. (BTW, sorry GG, I didn't see your other request for review in time.)

KerfuffleV2 · 2023-11-02T20:37:21Z

Though it compiles with this, something still seems wrong. I'm not sure if it's ROCM specific. Offloading the last non-repeating layer (KV) produces all NaNs. I.E. Orca 3B model with 26 actual layers is okay offloading up to 28, but 29 doesn't work. Mistral with 32 real layers is okay is okay at 34 but 35 doesn't work.

edit: Actually it seem like it's okay when compiling without LLAMA_FAST. I still don't know if it only affects ROCM though.

cebtenzzre · 2023-11-02T22:49:55Z

edit: Actually it seem like it's okay when compiling without LLAMA_FAST. I still don't know if it only affects ROCM though.

Could be related to #2268 (comment)

This reverts commit 629f917.

* Revert "cuda : add ROCM aliases for CUDA pool stuff (#3918)" This reverts commit 629f917. * Revert "cuda : use CUDA memory pool with async memory allocation/deallocation when available (#3903)" This reverts commit d606905. ggml-ci

* Revert "cuda : add ROCM aliases for CUDA pool stuff (ggerganov#3918)" This reverts commit 629f917. * Revert "cuda : use CUDA memory pool with async memory allocation/deallocation when available (ggerganov#3903)" This reverts commit d606905. ggml-ci

Add ROCM aliases for CUDA pool stuff

9276bff

KerfuffleV2 added bug Something isn't working build Compilation issues AMD GPU Issues specific to AMD GPUs labels Nov 2, 2023

ggerganov approved these changes Nov 2, 2023

View reviewed changes

ggerganov merged commit 629f917 into ggerganov:master Nov 2, 2023
27 of 31 checks passed

slaren added a commit that referenced this pull request Nov 4, 2023

Revert "cuda : add ROCM aliases for CUDA pool stuff (#3918)"

6b10aa9

This reverts commit 629f917.

KerfuffleV2 deleted the fix-rocm-build branch November 17, 2023 03:12

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

cuda : add ROCM aliases for CUDA pool stuff (ggerganov#3918)

f664fb5

phishmaster mentioned this pull request May 8, 2024

Significantly different results (and WRONG) inference when GPU is enabled. #7048

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ROCM aliases for CUDA pool stuff #3918

Add ROCM aliases for CUDA pool stuff #3918

KerfuffleV2 commented Nov 2, 2023 •

edited

Loading

KerfuffleV2 commented Nov 2, 2023 •

edited

Loading

cebtenzzre commented Nov 2, 2023

Add ROCM aliases for CUDA pool stuff #3918

Add ROCM aliases for CUDA pool stuff #3918

Conversation

KerfuffleV2 commented Nov 2, 2023 • edited Loading

KerfuffleV2 commented Nov 2, 2023 • edited Loading

cebtenzzre commented Nov 2, 2023

KerfuffleV2 commented Nov 2, 2023 •

edited

Loading

KerfuffleV2 commented Nov 2, 2023 •

edited

Loading