refactor _replace_linear_8da4w #451

Hanxian97 · 2024-06-27T08:00:47Z

Summary:
Reimplement the _replace_linear_8da4w function using the more general util function _replace_with_custom_fn_if_matches_filter from torchao.quantization.quant_api to reduce code duplication of similar logic.

Test Plan:
python test/quantization/test_quant_api.py
python test/quantization/test_quant_primitives.py

pytorch-bot · 2024-06-27T08:00:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/451

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2d4f772 with merge base dee13e1 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

andrewor14

Thanks, looks great overall, just a few small nits!

torchao/quantization/GPTQ.py

andrewor14 · 2024-06-27T17:01:59Z

torchao/quantization/GPTQ.py

+        if copy_weights and child.weight.device != torch.device("meta"):
+            new_linear.weight = child.weight
+        return new_linear
+        #setattr(module, name, new_linear)


Please delete the commented out code (this one and the block comment in 923)

jerryzh168 · 2024-06-27T21:18:30Z

_ No description provided. _

can you add a summary to talk about the context for the change?

jerryzh168 · 2024-06-27T21:20:58Z

please make sure to fill in Summary and Test Plan, like: #389

Hanxian97 · 2024-06-27T21:31:13Z

Summary:
Reimplement the _replace_linear_8da4w function using the more general util function _replace_with_custom_fn_if_matches_filter from torchao.quantization.quant_api to reduce code duplication of similar logic.

Test Plan:
python test/quantization/test_quant_api.py
python test/quantization/test_quant_primitives.py

jerryzh168 · 2024-06-28T00:31:38Z

Summary: Reimplement the _replace_linear_8da4w function using the more general util function _replace_with_custom_fn_if_matches_filter from torchao.quantization.quant_api to reduce code duplication of similar logic.

Test Plan: python test/quantization/test_quant_api.py python test/quantization/test_quant_primitives.py

you can edit the first message btw:

andrewor14

Looks good to me! @jerryzh168 any other comments?

andrewor14 · 2024-06-28T18:33:13Z

torchao/quantization/GPTQ.py

-            if _check_linear_int4_k(child.in_features, groupsize) or padding_allowed:
-                new_linear = linear_class(
+
+    #import the util function here to avoid circular dependency


nit: please add a space between # and import

andrewor14 · 2024-06-28T18:33:35Z

torchao/quantization/GPTQ.py

+    #import the util function here to avoid circular dependency
+    from torchao.quantization.quant_api import _replace_with_custom_fn_if_matches_filter
+
+    def filter_fn(child: torch.nn.Module, cur_fqn:str) -> bool:


nit: spacing cur_fqn: str

jerryzh168

LGTM, thanks for addressing all the comments

msaroufim · 2024-06-29T05:44:37Z

@pytorchbot merge

pytorchmergebot · 2024-06-29T05:45:02Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

msaroufim · 2024-06-29T05:47:22Z

@huydhn @clee2000 @jerryzh168 is it deliberate that we can no longer merge PRs manually now? Was this to enable ghstack? I'm not sure the tradeoff was worth it since most contributors are not using ghstack

I deliberately removed requirements to have the branch be up to date and authorized merging for anyone who is a maintainer and that makes merging contributor code simpler. Right now I'm just waiting on the bot to merge something that should be merged immediately

The required checks now don't have any tests either? The required checks are only some internal FB checks

EDIT: This seems to be an unrelated bug with branch protection rules internally, am following up

jerryzh168 · 2024-06-29T19:26:49Z

I'm not sure what happened, ghstack changes are not going to affect normal landing I think, this is unexpected. @huydhn do you know what is happening here?

* refactor _replace_linear_8da4w * clean up version ---------

refactor _replace_linear_8da4w

80c71b6

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 27, 2024

andrewor14 reviewed Jun 27, 2024

View reviewed changes

andrewor14 requested a review from jerryzh168 June 27, 2024 17:02

clean up version

5be1645

Hanxian97 closed this Jun 27, 2024

Hanxian97 reopened this Jun 27, 2024

andrewor14 approved these changes Jun 28, 2024

View reviewed changes

andrewor14 reviewed Jun 28, 2024

View reviewed changes

jerryzh168 approved these changes Jun 28, 2024

View reviewed changes

pytorchmergebot added the merging label Jun 29, 2024

pytorchmergebot added Merged and removed merging labels Jun 29, 2024

Merge branch 'main' into hanxian_8da4w_trial

2d4f772

msaroufim merged commit 39b02de into main Jul 1, 2024
13 checks passed

msaroufim deleted the hanxian_8da4w_trial branch July 1, 2024 16:11

dbyoung18 pushed a commit to dbyoung18/ao that referenced this pull request Jul 31, 2024

refactor _replace_linear_8da4w (pytorch#451)

6f71ef5

* refactor _replace_linear_8da4w * clean up version ---------

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor _replace_linear_8da4w #451

refactor _replace_linear_8da4w #451

Hanxian97 commented Jun 27, 2024 •

edited

Loading

pytorch-bot bot commented Jun 27, 2024 •

edited

Loading

andrewor14 left a comment

andrewor14 Jun 27, 2024

Hanxian97 Jun 27, 2024

jerryzh168 commented Jun 27, 2024

jerryzh168 commented Jun 27, 2024 •

edited

Loading

Hanxian97 commented Jun 27, 2024

jerryzh168 commented Jun 28, 2024

andrewor14 left a comment

andrewor14 Jun 28, 2024

andrewor14 Jun 28, 2024

jerryzh168 left a comment

msaroufim commented Jun 29, 2024

pytorchmergebot commented Jun 29, 2024

msaroufim commented Jun 29, 2024 •

edited

Loading

jerryzh168 commented Jun 29, 2024

refactor _replace_linear_8da4w #451

refactor _replace_linear_8da4w #451

Conversation

Hanxian97 commented Jun 27, 2024 • edited Loading

pytorch-bot bot commented Jun 27, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/451

✅ No Failures

andrewor14 left a comment

Choose a reason for hiding this comment

andrewor14 Jun 27, 2024

Choose a reason for hiding this comment

Hanxian97 Jun 27, 2024

Choose a reason for hiding this comment

jerryzh168 commented Jun 27, 2024

jerryzh168 commented Jun 27, 2024 • edited Loading

Hanxian97 commented Jun 27, 2024

jerryzh168 commented Jun 28, 2024

andrewor14 left a comment

Choose a reason for hiding this comment

andrewor14 Jun 28, 2024

Choose a reason for hiding this comment

andrewor14 Jun 28, 2024

Choose a reason for hiding this comment

jerryzh168 left a comment

Choose a reason for hiding this comment

msaroufim commented Jun 29, 2024

pytorchmergebot commented Jun 29, 2024

Merge started

msaroufim commented Jun 29, 2024 • edited Loading

jerryzh168 commented Jun 29, 2024

Hanxian97 commented Jun 27, 2024 •

edited

Loading

pytorch-bot bot commented Jun 27, 2024 •

edited

Loading

jerryzh168 commented Jun 27, 2024 •

edited

Loading

msaroufim commented Jun 29, 2024 •

edited

Loading