[SPMD] Support manual sharding #6915

alanwaketan · 2024-04-10T23:08:14Z

Summary:
This pull request makes SPMD support the manual sharding type via a new private API called: _mark_manual_sharding. I don't expect users will need to call this function explicitly.

Besides adding support for the sharding annotation, we also need to define the behavior of the data shards. For data, the current behavior is error out.

Test Plan:
PJRT_DEVICE=TPU python test/spmd/test_xla_sharding.py -v -k test__mark_manual_sharding

jonb377

Interesting! I've been wondering what manual sharding is for.

jonb377 · 2024-04-10T23:37:35Z

torch_xla/csrc/xla_sharding_util.h

-  // The the returned tensors will be in 1:1 correspondence with the `devices`
-  // vector, so the `i`th result will belong on the `i`th device.
+  // the `tile_assignment`; MANUAL sharding result in shards where only the
+  // first device holds the full data; the returned tensor shards vector is


only the first device holds the full data

Is this by definition of manual sharding?

This is not by definition, but by our implementation choice. A more proper example would be a list of tensors (DTensor), where each tensor is an individual full shard.

@yeounoh Will that be replicated then?

Per our offline discussion, abstain from manual sharding on input data.

jonb377 · 2024-04-10T23:39:53Z

torch_xla/csrc/init_python_bindings.cpp

-          result.reserve(cpu_shards.size() / shards_per_tensor);
-          for (int i = 0; i < cpu_shards.size(); i += shards_per_tensor) {
+            std::vector<at::Tensor> cpu_shards =
+                XlaDataToTensors(WrapXlaData(shard_handles), element_types);


Calling XlaDataToTensors on each tensor individually will slow down d2h transfers for async checkpointing, since PjRt won't be able to fully utilize transfer parallelization.

Do we expect manually-sharded tensors to contain actual device data generally, or will they usually be IR? If just IR, maybe we can add an assertion to prevent access here.

I rather keep it functional for both cases -- shouldn't it be asynchronous anyway, not blocking the actual training run?

This is interesting. I was not aware this performance optimization...

jonb377 · 2024-04-10T23:45:18Z

torch_xla/csrc/xla_sharding_util.cpp

+  } else if ((sharding.type() == xla::OpSharding::MANUAL)) {
+    // Just put the full tensor on the first device.
+    shards[0] = tensor;
+    shards.resize(1);


How does this work for a compuatation, since we need to feed each device some input data?

e.g. based on your unit test, what happens if we run:

x = torch.randn(3, 2) xx = x.to(xm.xla_device()) # xx is device data xt = xs._mark_manual_sharding(xx) ones = torch.ones(3, 2).to(xm.xla_device()) # ones is replicated to all devices print(xt + ones) # What will happen here?

XLA should assume that xt is sharded manually, so expected to be plicated as well. The purpose of MANUAL is to support custom kernel and prevent XLA to override the manual sharding.

Good question. I would expect it behaves as a single device. Let me double check as well.

yeounoh · 2024-04-11T17:15:33Z

test/spmd/test_xla_sharding.py

+    xt = xs._mark_manual_sharding(xx)
+
+    hlo = torch_xla._XLAC._get_xla_tensors_hlo([xt.global_tensor])
+    self.assertIn('parameter(0), sharding={manual}', hlo)


yeounoh

LGTM, I leave the correctness review of distributed checkpointing with manual sharding to @jonb377 and his unit tests.

yeounoh · 2024-04-11T17:22:57Z

test/spmd/test_xla_sharding.py

@@ -1100,6 +1100,26 @@ def test_global_mesh(self):

    self.assertEqual(id(mesh), id(expected_mesh))

+  def test__mark_manual_sharding(self):


nit. even though it's testing the _ prefixed api, let's keep it as test_mark_manual_sharding

alanwaketan · 2024-04-11T19:26:14Z

Here is the new TPU CI run: https://github.com/pytorch/xla/actions/runs/8652176761

alanwaketan · 2024-04-11T22:59:27Z

TPU test here: https://github.com/pytorch/xla/actions/runs/8654305716

alanwaketan · 2024-04-12T01:18:36Z

All tests passed. I'm going to merge it. Let me know if I need to follow up on anything.

Summary: This pull request makes SPMD support the manual sharding type via a new private API called: _mark_manual_sharding. I don't expect users will need to call this function explicitly. Besides adding support for the sharding annotation, we also need to define the behavior of the data shards. For data, the current behavior is error out. Test Plan: PJRT_DEVICE=TPU python test/spmd/test_xla_sharding.py -v -k test__mark_manual_sharding

alanwaketan added 4 commits April 10, 2024 22:59

initial commit

d77e011

Fix cpu test

39008eb

Fix linters

e666e3b

Update comment

995af75

alanwaketan requested review from yeounoh and jonb377 April 10, 2024 23:08

alanwaketan self-assigned this Apr 10, 2024

jonb377 reviewed Apr 10, 2024

View reviewed changes

yeounoh reviewed Apr 11, 2024

View reviewed changes

yeounoh approved these changes Apr 11, 2024

View reviewed changes

yeounoh reviewed Apr 11, 2024

View reviewed changes

alanwaketan added 3 commits April 11, 2024 20:18

Fix one comment

2db7f0a

Disallow mark manual sharding on data tensors

84cd609

Fix linters

faeb7c7

alanwaketan merged commit e5513ff into master Apr 12, 2024
20 checks passed

baoleai mentioned this pull request Aug 6, 2024

Add manual sharding API for SPMD AlibabaPAI/xla#2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPMD] Support manual sharding #6915

[SPMD] Support manual sharding #6915

alanwaketan commented Apr 10, 2024 •

edited

Loading

jonb377 left a comment

jonb377 Apr 10, 2024

yeounoh Apr 11, 2024 •

edited

Loading

alanwaketan Apr 11, 2024

yeounoh Apr 12, 2024

jonb377 Apr 10, 2024

yeounoh Apr 11, 2024

alanwaketan Apr 11, 2024

jonb377 Apr 10, 2024 •

edited

Loading

yeounoh Apr 11, 2024

alanwaketan Apr 11, 2024

yeounoh Apr 11, 2024

yeounoh left a comment

yeounoh Apr 11, 2024

alanwaketan commented Apr 11, 2024

alanwaketan commented Apr 11, 2024

alanwaketan commented Apr 12, 2024

		@@ -1100,6 +1100,26 @@ def test_global_mesh(self):

		self.assertEqual(id(mesh), id(expected_mesh))

		def test__mark_manual_sharding(self):

[SPMD] Support manual sharding #6915

[SPMD] Support manual sharding #6915

Conversation

alanwaketan commented Apr 10, 2024 • edited Loading

jonb377 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yeounoh Apr 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonb377 Apr 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yeounoh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alanwaketan commented Apr 11, 2024

alanwaketan commented Apr 11, 2024

alanwaketan commented Apr 12, 2024

alanwaketan commented Apr 10, 2024 •

edited

Loading

yeounoh Apr 11, 2024 •

edited

Loading

jonb377 Apr 10, 2024 •

edited

Loading