Error in Tabby deployment - llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs #1666

mprudra · 2024-03-13T10:16:30Z

Describe the bug
I'm noticing below error with our Tabby deployment, looks like a memory error. Don't have any additional logs, since we've modified the logs to mask input, output information, this was needed for production deployment.
Process exit code was 1.

cmpl-dc7c656b-2a60-4276-8940-2a578d26e198: Generated 2 tokens in 56.007768ms at 35.709332319759646 tokens/s
cmpl-9c5e112f-5024-4d1b-a7b4-5a3f5dab21c2: Generated 2 tokens in 80.706173ms at 24.781251862853164 tokens/s
2024-03-11T23:00:58.450411Z ERROR llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs:78: Failed to step: _Map_base::at

Information about your version
0.5.5

Information about your GPU

+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.154.05             Driver Version: 535.154.05   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
...
...
...
|   3  NVIDIA A100 80GB PCIe          On  | 00000000:E3:00.0 Off |                    0 |
| N/A   44C    P0              74W / 300W |  18141MiB / 81920MiB |      0%   E. Process |
|                                         |                      |             Disabled |
+-----------------------------------------+----------------------+----------------------+

The text was updated successfully, but these errors were encountered:

wsxiaoys · 2024-03-13T10:32:41Z

Hi, thanks for reporting the issue. Would you please upgrade to 0.9.0 to see if the problem still persist?

mprudra · 2024-03-13T11:27:02Z

Would require significant efforts, will keep this as last resort.
Do you have any idea about what could be the cause of this error? Is this issue known to some previous versions?

sergei-dyshel · 2024-03-26T22:05:20Z

Happens for me too, on 0.9.1, when running with delwiv/codefuse-deepseek-33B model. Doesn't happen with TabbyML/DeepseekCoder-6.7B model.

wsxiaoys · 2024-03-26T22:37:24Z

Happens for me too, on 0.9.1, when running with delwiv/codefuse-deepseek-33B model. Doesn't happen with TabbyML/DeepseekCoder-6.7B model.

Could you also share the log output and your system info?

wsxiaoys · 2024-03-27T00:16:29Z

Seems related:
ggerganov/llama.cpp#3959
ggerganov/llama.cpp#4206

@mprudra could you share the model you were using when encountering the issue?

mprudra · 2024-03-27T16:57:19Z

Happens for me too, on 0.9.1, when running with delwiv/codefuse-deepseek-33B model. Doesn't happen with TabbyML/DeepseekCoder-6.7B model.

...

Seems related: ggerganov/llama.cpp#3959 ggerganov/llama.cpp#4206

@mprudra could you share the model you were using when encountering the issue?

~~I'm also using our fined-tuned version of DeepSeekCoder-33b.~~
Correction: I had noticed it with 6.7B model.

mprudra · 2024-03-28T07:07:36Z

Is it the case that Deepseek-Coder models aren't yet supported?
(Deepseek coder merge, ggerganov/llama.cpp#5464)[https://github.com//issues/1666]

gyxlucy · 2024-04-04T06:03:51Z

ggerganov/llama.cpp#5981 is the latest issue opened to support deepseek in llama.cpp

wsxiaoys · 2024-11-15T22:43:52Z

Deepseek series model has been supported.

mprudra added the bug-unconfirmed label Mar 13, 2024

wsxiaoys closed this as completed Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in Tabby deployment - llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs #1666

Error in Tabby deployment - llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs #1666

mprudra commented Mar 13, 2024 •

edited

Loading

wsxiaoys commented Mar 13, 2024

mprudra commented Mar 13, 2024 •

edited

Loading

sergei-dyshel commented Mar 26, 2024

wsxiaoys commented Mar 26, 2024

wsxiaoys commented Mar 27, 2024

mprudra commented Mar 27, 2024 •

edited

Loading

mprudra commented Mar 28, 2024

gyxlucy commented Apr 4, 2024

wsxiaoys commented Nov 15, 2024

Error in Tabby deployment - llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs #1666

Error in Tabby deployment - llama_cpp_bindings::llama: crates/llama-cpp-bindings/src/llama.rs #1666

Comments

mprudra commented Mar 13, 2024 • edited Loading

wsxiaoys commented Mar 13, 2024

mprudra commented Mar 13, 2024 • edited Loading

sergei-dyshel commented Mar 26, 2024

wsxiaoys commented Mar 26, 2024

wsxiaoys commented Mar 27, 2024

mprudra commented Mar 27, 2024 • edited Loading

mprudra commented Mar 28, 2024

gyxlucy commented Apr 4, 2024

wsxiaoys commented Nov 15, 2024

mprudra commented Mar 13, 2024 •

edited

Loading

mprudra commented Mar 13, 2024 •

edited

Loading

mprudra commented Mar 27, 2024 •

edited

Loading