You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I reviewed the Discussions, and have a new bug or useful enhancement to share.
Expected Behavior
I should be able to offload as many layers to the GPU as it can fit when compiled with clblast support and run with flag --n-gpu-layers.
Current Behavior
Since shortly after approx. master-d01bccd (last good version I have) I've been getting asserts when offloading more than ~14 layers when llama.cpp is compiled for clblast.
All ggml .bins (historically) & .gguf models I've tried have the same problem at approx this number of offloaded layers - exact number possibly dependant on the model size. If I offload less they run fine... just slower of course.
Prerequisites
Expected Behavior
I should be able to offload as many layers to the GPU as it can fit when compiled with clblast support and run with flag
--n-gpu-layers
.Current Behavior
Since shortly after approx. master-d01bccd (last good version I have) I've been getting asserts when offloading more than ~14 layers when llama.cpp is compiled for clblast.
All ggml .bins (historically) & .gguf models I've tried have the same problem at approx this number of offloaded layers - exact number possibly dependant on the model size. If I offload less they run fine... just slower of course.
Suspect #2411 is causing the behaviour.
Environment and Context
Failure Information (for bugs)
Steps to Reproduce
Please provide detailed steps for reproducing the issue. We are not sitting in front of your screen, so the more detail the better.
cmake .. -DLLAMA_CLBLAST=ON
cmake --build . --config Release
~/ai/apps/llama.cpp/build/bin/main --interactive-first --multiline-input --instruct --color --threads 8 --n-predict -1 --repeat-last-n -1 --repeat-penalty 1.1 --mirostat 1 --ctx-size 2048 --keep 0 --temp 1.0 --batch-size 2048 --n-gpu-layers 15 --verbose-prompt --model /run/media/user/street/ai/models/llama/gguf/airoboros-l2-13b-2.1.Q8_0.gguf
Failure Logs
The text was updated successfully, but these errors were encountered: