llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Stillerman · 2024-07-20T23:01:32Z

Adding SmolLM pre-tokenizer support for SmolLM models.

Added tokenizer type for SmolLM-135M in convert-hf-to-gguf-update.py
Added the chkhsh for SmolLM-135M in convert-hf-to-gguf.py
Added LLAMA_VOCAB_PRE_TYPE_SMOLLM enum to llama.h

Ran ./tests/test-tokenizer-0.sh smollm ./models/ggml-vocab-smollm.gguf and Tests passed.

Thank you @m18coppola for #8579

@loubnabnl @anton-l Does src/llama.cpp look right? Any special settings needed in the tokenizer? The .gguf I created for SmolLM-135M seemed to inference well.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

convert_hf_to_gguf_update.py

src/llama.cpp

Co-authored-by: compilade <git@compilade.net>

models/ggml-vocab-smollm.gguf.inp

Vaibhavs10

Tested this! Seems to work well for me too!

eliebak · 2024-07-22T14:44:55Z

Seems to work well for me too @Stillerman, thanks a lot for adding support to our model! 🤗

9cento · 2024-07-26T23:20:26Z

The 1.7B model prints out nonsense, isn't that supported? Asking for clarification.

* Adding SmolLM Pre Tokenizer * Update convert_hf_to_gguf_update.py Co-authored-by: compilade <git@compilade.net> * Update src/llama.cpp Co-authored-by: compilade <git@compilade.net> * handle regex * removed .inp and out .out ggufs --------- Co-authored-by: compilade <git@compilade.net>

Adding SmolLM Pre Tokenizer

28bd56f

github-actions bot added the python python script changes label Jul 20, 2024

compilade reviewed Jul 21, 2024

View reviewed changes

convert_hf_to_gguf_update.py Outdated Show resolved Hide resolved

src/llama.cpp Show resolved Hide resolved

src/llama.cpp Show resolved Hide resolved

Stillerman and others added 2 commits July 21, 2024 02:48

Update convert_hf_to_gguf_update.py

7647916

Co-authored-by: compilade <git@compilade.net>

Update src/llama.cpp

689e38c

Co-authored-by: compilade <git@compilade.net>

Stillerman mentioned this pull request Jul 21, 2024

Supports SmolLM Mozilla-Ocho/llamafile#495

Merged

handle regex

f4600e6

ggerganov approved these changes Jul 21, 2024

View reviewed changes

models/ggml-vocab-smollm.gguf.inp Outdated Show resolved Hide resolved

ngxson linked an issue Jul 21, 2024 that may be closed by this pull request

Support for SmolLM #8608

Closed

4 tasks

removed .inp and out .out ggufs

525e789

Vaibhavs10 approved these changes Jul 22, 2024

View reviewed changes

ggerganov merged commit d94c6e0 into ggerganov:master Jul 22, 2024
55 checks passed

This was referenced Jul 22, 2024

llama : move vocab, grammar and sampling into separate files #8508

Merged

llama : fix codeshell support (#8250) #8599

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Stillerman commented Jul 20, 2024 •

edited

Loading

Vaibhavs10 left a comment

eliebak commented Jul 22, 2024

9cento commented Jul 26, 2024

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

llama : Added support for SmolLm pre-tokenizer (#8608) #8609

Conversation

Stillerman commented Jul 20, 2024 • edited Loading

Vaibhavs10 left a comment

Choose a reason for hiding this comment

eliebak commented Jul 22, 2024

9cento commented Jul 26, 2024

Stillerman commented Jul 20, 2024 •

edited

Loading