Add gguf support for StableLM #33793

VladOS95-cyber · 2024-09-29T14:34:09Z

What does this PR do?

Add GGUF support for StableLM

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Link: Community contribution: Adding GGUF support for more architectures #33260
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Regarding the task @SunMarc @LysandreJik @ArthurZucker .

VladOS95-cyber · 2024-09-29T14:37:06Z

Hi @SunMarc! This PR is ready for review, please, take a look.

SunMarc

Thanks for adding @VladOS95-cyber as always ! Left a few comments

src/transformers/models/gpt_neox/tokenization_gpt_neox_fast.py

tests/quantization/ggml/test_ggml.py

HuggingFaceDocBuilderDev · 2024-10-01T16:30:06Z

Hey! 🤗 Thanks for your contribution to the transformers library!

Before merging this pull request, slow tests CI should be triggered. To enable this:

Add the run-slow label to the PR
When your PR is ready for merge and all reviewers' comments have been addressed, push an empty commit with the command [run-slow] followed by a comma separated list of all the models to be tested, i.e. [run_slow] model_to_test_1, model_to_test_2
- If the pull request affects a lot of models, put at most 10 models in the commit message
A transformers maintainer will then approve the workflow to start the tests

(For maintainers) The documentation for slow tests CI on PRs is here.

VladOS95-cyber · 2024-10-01T17:11:03Z

Hey @SunMarc! I resolved all comments, you could take a look again

SunMarc

Thanks for iterating ! Could you also add a test to check the weights of each layer for the fp16 model, to see if we get the same model ? We recently merged the gguf falcon model and he added a nice test for that. Also there are a few merge conflits, thanks !

VladOS95-cyber · 2024-10-02T12:52:46Z

Thanks for iterating ! Could you also add a test to check the weights of each layer for the fp16 model, to see if we get the same model ? We recently merged the gguf falcon model and he added a nice test for that. Also there are a few merge conflits, thanks !

Hey @SunMarc! Sure, it is done

SunMarc

LGTM ! Left a question to better understand

tests/quantization/ggml/test_ggml.py

HuggingFaceDocBuilderDev · 2024-10-02T14:49:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

VladOS95-cyber · 2024-10-09T06:16:50Z

Hi @LysandreJik! Just a kind reminder to take a look on this PR!

LysandreJik

Thanks @VladOS95-cyber, this seems to have passed through the net. Looks great to me!

* add stablelm gguf architecture support * add additional quantization tests * resolve merge conflict, add weight conversion tests for fp16

VladOS95-cyber changed the title ~~add stablelm gguf architecture support~~ Add gguf support for StableLM Sep 29, 2024

SunMarc reviewed Sep 30, 2024

View reviewed changes

src/transformers/models/gpt_neox/tokenization_gpt_neox_fast.py Show resolved Hide resolved

tests/quantization/ggml/test_ggml.py Show resolved Hide resolved

VladOS95-cyber force-pushed the add-GGUF-support-for-StableLM branch from c7b1540 to 9c5e33e Compare October 1, 2024 16:29

SunMarc reviewed Oct 2, 2024

View reviewed changes

SunMarc mentioned this pull request Oct 2, 2024

Community contribution: Adding GGUF support for more architectures #33260

Open

15 tasks

VladOS95-cyber added 3 commits October 2, 2024 14:41

add stablelm gguf architecture support

e6f006a

add additional quantization tests

c4339ad

resolve merge conflict, add weight conversion tests for fp16

1d13301

VladOS95-cyber force-pushed the add-GGUF-support-for-StableLM branch from 9c5e33e to 1d13301 Compare October 2, 2024 12:52

SunMarc approved these changes Oct 2, 2024

View reviewed changes

tests/quantization/ggml/test_ggml.py Show resolved Hide resolved

SunMarc requested a review from LysandreJik October 2, 2024 14:36

LysandreJik approved these changes Oct 9, 2024

View reviewed changes

LysandreJik merged commit faa0f63 into huggingface:main Oct 9, 2024
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gguf support for StableLM #33793

Add gguf support for StableLM #33793

VladOS95-cyber commented Sep 29, 2024 •

edited

Loading

VladOS95-cyber commented Sep 29, 2024

SunMarc left a comment

HuggingFaceDocBuilderDev commented Oct 1, 2024

VladOS95-cyber commented Oct 1, 2024

SunMarc left a comment

VladOS95-cyber commented Oct 2, 2024

SunMarc left a comment

HuggingFaceDocBuilderDev commented Oct 2, 2024

VladOS95-cyber commented Oct 9, 2024 •

edited

Loading

LysandreJik left a comment

Add gguf support for StableLM #33793

Add gguf support for StableLM #33793

Conversation

VladOS95-cyber commented Sep 29, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

VladOS95-cyber commented Sep 29, 2024

SunMarc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 1, 2024

VladOS95-cyber commented Oct 1, 2024

SunMarc left a comment

Choose a reason for hiding this comment

VladOS95-cyber commented Oct 2, 2024

SunMarc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 2, 2024

VladOS95-cyber commented Oct 9, 2024 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

VladOS95-cyber commented Sep 29, 2024 •

edited

Loading

VladOS95-cyber commented Oct 9, 2024 •

edited

Loading