Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype #6454

amithrm · 2024-02-02T00:21:54Z

This a cumulative PR with misc bug fixes and updates to Zero Redundancy Optimizer from all the authors (AWS): Guangtai Huang, Rahul Solanki, Fei Wu, Amith Mamidala

jeffhataws · 2024-02-03T18:31:21Z

torch_xla/distributed/zero_redundancy_optimizer.py

+    # Here we pop the differentiable default because the adam family of
+    # optimizers don't have differentiable as an argument. This should
+    # be fixed by this commit https://github.com/pytorch/pytorch/pull/86183
+    # and should be available in torch==2.0. For 1.13, we are patching it here.


Hi @amithrm , here it says "This should be fixed by this commit pytorch/pytorch#86183 and should be available in torch==2.0." Can you remove this patch?

@jeffhataws PTAL

…ptimizer_dtype

alanwaketan

Can we have a test case to cover the change?

jeffhataws · 2024-03-11T16:13:17Z

torch_xla/distributed/zero_redundancy_optimizer.py

-              pin_layout=self.pin_layout,
-              groups=self.sharding_groups,
-          )
+          sharded_data.append(shard_data)


Is this gathering all the parameters into one bucket?

jeffhataws · 2024-03-21T22:46:36Z

The changes here should already be in #6025 , as confirmed by Guangtai.

amithrm changed the title ~~Fixing bug~~ Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype Feb 2, 2024

amithrm force-pushed the zero1 branch from 5a61ef1 to 660a3c7 Compare February 2, 2024 23:18

jeffhataws reviewed Feb 3, 2024

View reviewed changes

Misc bug fixes in Zero optimizer: handling differentiable argument, o…

1acaf4b

…ptimizer_dtype

amithrm force-pushed the zero1 branch from 660a3c7 to 1acaf4b Compare March 2, 2024 00:10

Misc bug fixes and optimizations in Zero optimizer

cc2acd1

jeffhataws requested review from JackCaoG and alanwaketan March 4, 2024 18:51

alanwaketan reviewed Mar 4, 2024

View reviewed changes

jeffhataws reviewed Mar 11, 2024

View reviewed changes

jeffhataws closed this Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype #6454

Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype #6454

amithrm commented Feb 2, 2024 •

edited

Loading

jeffhataws Feb 3, 2024

amithrm Mar 1, 2024

amithrm Mar 2, 2024

alanwaketan left a comment

jeffhataws Mar 11, 2024

jeffhataws commented Mar 21, 2024

Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype #6454

Misc bug fixes in Zero optimizer: handling differentiable argument, optimizer_dtype #6454

Conversation

amithrm commented Feb 2, 2024 • edited Loading

jeffhataws Feb 3, 2024

Choose a reason for hiding this comment

amithrm Mar 1, 2024

Choose a reason for hiding this comment

amithrm Mar 2, 2024

Choose a reason for hiding this comment

alanwaketan left a comment

Choose a reason for hiding this comment

jeffhataws Mar 11, 2024

Choose a reason for hiding this comment

jeffhataws commented Mar 21, 2024

amithrm commented Feb 2, 2024 •

edited

Loading