Add Qwen2 GGUF loading support #31175

Isotr0py · 2024-06-01T12:53:24Z

What does this PR do?

Add Qwen2 GGUF loading support
Use model_type for gguf tokenizer converter selection instead of tokenizer_type

According to convert-hf-to-gguf.py, most of models may register tokenizer as gpt2 tokenizer. Use model_type to select corresponding tokenizer instead of tokenizer_type.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

younesbelkada

Thanks a lot for this great contribution ! Can you confirm the other slow tests pass ? 🙏 I left few minor comments, what do you think?

src/transformers/integrations/ggml.py

src/transformers/tokenization_utils_fast.py

younesbelkada

Great work ! Thanks for adding Qwen2 support for GGUF files ! Can you run the styling checks? make fixup after that this PR is ready IMO

amyeroberts

Thanks for adding!

src/transformers/convert_slow_tokenizer.py

src/transformers/integrations/ggml.py

* add qwen2 gguf support * Update docs * fix qwen2 tokenizer * add qwen2 gguf test * fix typo in qwen2 gguf test * format code * Remove mistral, clarify the error message * format code * add typing and update docstring

Isotr0py added 6 commits June 1, 2024 01:41

add qwen2 gguf support

53c1221

Update docs

c134d02

fix qwen2 tokenizer

638ca42

add qwen2 gguf test

f035536

fix typo in qwen2 gguf test

d3241f9

format code

e758112

younesbelkada reviewed Jun 3, 2024

View reviewed changes

Remove mistral, clarify the error message

b8065ad

younesbelkada approved these changes Jun 3, 2024

View reviewed changes

format code

cf5e14e

younesbelkada requested a review from amyeroberts June 3, 2024 12:19

amyeroberts approved these changes Jun 3, 2024

View reviewed changes

src/transformers/convert_slow_tokenizer.py Outdated Show resolved Hide resolved

src/transformers/integrations/ggml.py Show resolved Hide resolved

add typing and update docstring

48bdb5e

amyeroberts merged commit e462843 into huggingface:main Jun 3, 2024
20 checks passed

Isotr0py deleted the gguf branch June 3, 2024 13:58

younesbelkada mentioned this pull request Jun 3, 2024

Loading GGUF files support #30391

Merged

SunMarc mentioned this pull request Jul 8, 2024

Add support for GGUF Phi-3 #31826

Closed

2 tasks

a8nova mentioned this pull request Jul 8, 2024

Add support for GGUF Phi-3 #31844

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen2 GGUF loading support #31175

Add Qwen2 GGUF loading support #31175

Isotr0py commented Jun 1, 2024

younesbelkada left a comment

younesbelkada left a comment

amyeroberts left a comment

Add Qwen2 GGUF loading support #31175

Add Qwen2 GGUF loading support #31175

Conversation

Isotr0py commented Jun 1, 2024

What does this PR do?

Before submitting

Who can review?

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment