Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 #33456

jerryzh168 · 2024-09-13T01:03:17Z

Summary:
After huggingface/huggingface_hub#2440 we added non-safetensor serialization and deserialization in huggingface, with this we can now add the support in transformers

Note that we don't plan to add safetensor serialization due to different goals of wrapper tensor subclass and safetensor see README for more details

Test Plan:
tested locally
https://gist.github.com/jerryzh168/965ccdbd595c9210d49cfbe31dc6705f

Reviewers:

Subscribers:

Tasks:

Tags:

jerryzh168 · 2024-09-16T18:18:46Z

cc @SunMarc @Wauplin can you take a look?

SunMarc

Thanks for your work @jerryzh168 to enable serialization ! Really appreciate that you are doing the PRs on huggingface hub and transformers ! This looks pretty good ! I left a few comments

docs/source/en/quantization/torchao.md

src/transformers/modeling_utils.py

src/transformers/quantizers/quantizer_torchao.py

src/transformers/utils/quantization_config.py

src/transformers/quantizers/quantizer_torchao.py

jerryzh168 · 2024-09-18T02:01:24Z

@SunMarc thanks for your thoughtful reviews! I have addressed all the comments I think, please take a look again, also not sure if the CI failure is relevant or not

jerryzh168 · 2024-09-18T02:11:41Z

btw, current pytorch nightly has a perf regression: pytorch/ao#898 and we hope to fix this before 2.5 cherry-pick deadline

SunMarc

Thanks for iterating! LGTM! Just a nit. Let me know when the regression is fixed. I've pinged a core maintainer to review the PR

src/transformers/modeling_utils.py

SunMarc · 2024-09-19T17:19:22Z

To fix the failing test, can you rebase on main ?

HuggingFaceDocBuilderDev · 2024-09-19T17:43:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jerryzh168 · 2024-09-24T22:09:50Z

@ArthurZucker can you take a look at the PR?

SunMarc · 2024-09-25T12:41:11Z

I've merged recently PR adding a new quantizer @jerryzh168. Sorry for that but could you rebase on main and update the is_serializable method ?

…nfig quantized model Summary: After huggingface/huggingface_hub#2440 we added non-safetensor serialization and deserialization in huggingface, with this we can now add the support in transformers Note that we don't plan to add safetensor serialization due to different goals of wrapper tensor subclass and safetensor see README for more details Test Plan: tested locally Reviewers: Subscribers: Tasks: Tags:

jerryzh168 · 2024-09-25T22:31:10Z

@SunMarc @ArthurZucker updated, please take a look again

ArthurZucker

Thanks for this PR, super sorry for the delay!
Super important to have serialization!

ArthurZucker · 2024-09-30T09:28:55Z

src/transformers/quantizers/quantizer_torchao.py

-    @property
-    def is_serializable(self):
-        return False
+    def is_serializable(self, safe_serialization=None):


changing the property is a tad breaking, so let's just put the 🔴 on the PR!

ArthurZucker · 2024-09-30T09:31:07Z

Thanks a lot @jerryzh168 🤗 great contributions and I love that we can upload serialized quantized weights to the hub now!

jerryzh168 changed the title ~~Enable non-safetensor serialization and deserialization for TorchAoCo…~~ Enable non-safetensor ser/deser for TorchAoConfig quantized model Sep 13, 2024

jerryzh168 force-pushed the enable-torchao-ser branch from 2a29247 to 8db30bd Compare September 13, 2024 16:26

SunMarc reviewed Sep 16, 2024

View reviewed changes

jerryzh168 mentioned this pull request Sep 18, 2024

[RFC] torchao Contributor Guide pytorch/ao#391

Open

jerryzh168 requested a review from SunMarc September 19, 2024 03:04

SunMarc approved these changes Sep 19, 2024

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

SunMarc requested a review from ArthurZucker September 19, 2024 17:18

jerryzh168 force-pushed the enable-torchao-ser branch from 4f44f40 to dc76271 Compare September 19, 2024 17:23

jerryzh168 force-pushed the enable-torchao-ser branch from f4d7c51 to 93f8c89 Compare September 25, 2024 17:19

jerryzh168 added 10 commits September 25, 2024 15:03

formatting

c333f75

formatting

484754f

minor fix

cfe67e7

formatting

630f700

address comments

2e6d0b0

comments

7111136

minor fix

7b9479d

update doc

dab29c7

refactor compressed tensor quantizer

b9543c8

jerryzh168 force-pushed the enable-torchao-ser branch from 93f8c89 to b9543c8 Compare September 25, 2024 22:03

ArthurZucker approved these changes Sep 30, 2024

View reviewed changes

ArthurZucker changed the title ~~Enable non-safetensor ser/deser for TorchAoConfig quantized model~~ Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 Sep 30, 2024

ArthurZucker merged commit 4bb49d4 into huggingface:main Sep 30, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 #33456

Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 #33456

jerryzh168 commented Sep 13, 2024 •

edited

Loading

jerryzh168 commented Sep 16, 2024

SunMarc left a comment

jerryzh168 commented Sep 18, 2024

jerryzh168 commented Sep 18, 2024

SunMarc left a comment

SunMarc commented Sep 19, 2024

HuggingFaceDocBuilderDev commented Sep 19, 2024

jerryzh168 commented Sep 24, 2024

SunMarc commented Sep 25, 2024 •

edited

Loading

jerryzh168 commented Sep 25, 2024

ArthurZucker left a comment

ArthurZucker Sep 30, 2024

ArthurZucker commented Sep 30, 2024

Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 #33456

Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 #33456

Conversation

jerryzh168 commented Sep 13, 2024 • edited Loading

jerryzh168 commented Sep 16, 2024

SunMarc left a comment

Choose a reason for hiding this comment

jerryzh168 commented Sep 18, 2024

jerryzh168 commented Sep 18, 2024

SunMarc left a comment

Choose a reason for hiding this comment

SunMarc commented Sep 19, 2024

HuggingFaceDocBuilderDev commented Sep 19, 2024

jerryzh168 commented Sep 24, 2024

SunMarc commented Sep 25, 2024 • edited Loading

jerryzh168 commented Sep 25, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Sep 30, 2024

Choose a reason for hiding this comment

ArthurZucker commented Sep 30, 2024

jerryzh168 commented Sep 13, 2024 •

edited

Loading

SunMarc commented Sep 25, 2024 •

edited

Loading