Llama3 Tokenizer #4082

Bearsaerker · 2024-05-01T14:05:15Z

What is the issue?

I requanted the llama3 Sauerkraut with the newest release of llama cpp which should have fixed the tokenizer, but when I load the model into Ollama, I still get the wrong output while people using llama cpp get the right one. So I'd say that there is still something buggy in ollama. Here is the Output.
"What is 7777 + 3333?
Let me calculate that for you!

77,777 (first number) + 33,333 (second number) = 111,110

So the answer is 111,110!"

OS

Linux

GPU

Nvidia

CPU

AMD

Ollama version

0.1.32

MoonRide303 · 2024-05-01T14:12:16Z

Yeah, same here. Freshly converted GGUFs of both original L3 Instruct and various finetunes that give proper answer to this question in llama.cpp (b2776) or koboldcpp (1.64) fail when imported into ollama (0.1.32).

coder543 · 2024-05-01T14:44:19Z

Duplicate? #4026

MoonRide303 · 2024-05-01T15:00:16Z

@coder543 Yeah, seems to be the same thing.

Bearsaerker · 2024-05-01T15:20:22Z

Oh yeah duplicate, Im gonna close this then.

Bearsaerker added the bug Something isn't working label May 1, 2024

MoonRide303 mentioned this issue May 1, 2024

llama : improve BPE pre-processing + LLaMA 3 and Deepseek support ggerganov/llama.cpp#6920

Merged

Bearsaerker closed this as completed May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3 Tokenizer #4082

Llama3 Tokenizer #4082

Bearsaerker commented May 1, 2024

MoonRide303 commented May 1, 2024 •

edited

Loading

coder543 commented May 1, 2024

MoonRide303 commented May 1, 2024

Bearsaerker commented May 1, 2024

Llama3 Tokenizer #4082

Llama3 Tokenizer #4082

Comments

Bearsaerker commented May 1, 2024

What is the issue?

OS

GPU

CPU

Ollama version

MoonRide303 commented May 1, 2024 • edited Loading

coder543 commented May 1, 2024

MoonRide303 commented May 1, 2024

Bearsaerker commented May 1, 2024

MoonRide303 commented May 1, 2024 •

edited

Loading