Remove obsolete device special cases #6197

will-cromar · 2023-12-18T18:42:22Z

No description provided.

JackCaoG

Thanks! Can you add test for U16, F64 etc on TPU CI to make sure they are actually natively supported?

will-cromar · 2024-01-02T18:46:44Z

Thanks! Can you add test for U16, F64 etc on TPU CI to make sure they are actually natively supported?

I tested this manually, but I'll make sure there's a unit test for each one.

…bsolete-cases

will-cromar · 2024-01-02T19:41:40Z

Thanks! Can you add test for U16, F64 etc on TPU CI to make sure they are actually natively supported?

Added a test that iterates through the dtypes that both torch and XLA support.

I found the list of supported dtypes on TPU internally, which does not include complex128, so using that dtype on TPU will trigger an error. The others are all supported now. We have two options for complex128:

Change our behavior to throw an error for unsupported dtypes. IMO this is the safer behavior. Plus, SPMD obscures the actual device type, which makes it hard to determine what the "real" device is without accessing runtime state.
Add downcasting back for complex128 only.

JackCaoG · 2024-01-02T21:09:29Z

yea.. I don't know how many people actually uses complex128, is it supported on GPU? As we are increasing the effort on GPU this year, I don't want to regress PyTorch/XLA:GPU

will-cromar · 2024-01-02T21:23:19Z

yea.. I don't know how many people actually uses complex128, is it supported on GPU? As we are increasing the effort on GPU this year, I don't want to regress PyTorch/XLA:GPU

This PR won't affect behavior on GPU. I'm basically just removing the downcast when creating tensors on TPUs, and we never changed the dtype on GPU. If the underlying XLA:GPU client doesn't support complex128 (it probably does), then it would just throw an error.

JackCaoG · 2024-01-02T21:38:58Z

torch_xla/csrc/dtype.cpp

@@ -163,8 +153,7 @@ xla::PrimitiveType MaybeDowncastToXlaDeviceType(
      if (UseBF16()) {
        return xla::PrimitiveType::BF16;
      }
-      if (DowncastBF16() || DowncastF16() || IsTpuDevice(hw_type) ||
-          hw_type == XlaDeviceType::NEURON) {
+      if (DowncastBF16() || DowncastF16() || hw_type == XlaDeviceType::NEURON) {


@jeffhataws do you know what dtype that nuron device does not support?

@will-cromar in the long term I think we might want to come up with a mechanism for each backend to register what kind of dtypes they support and how do they want to map pytorch type to xla type.

@will-cromar in the long term I think we might want to come up with a mechanism for each backend to register what kind of dtypes they support and how do they want to map pytorch type to xla type.

Yeah, if we keep this downcasting behavior, I will consolidate it in the DevicePlugin API (see #6242)

Yeah it is best to keep fp32 since it is currently supported as in https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/arch/neuron-features/data-types.html.

will-cromar · 2024-01-03T00:25:13Z

Hmm, both CPU and GPU fail the test I added with complex128:

Traceback (most recent call last):
  File "/opt/conda/lib/python3.8/site-packages/absl/testing/parameterized.py", line 323, in bound_param_test
    return test_method(self, testcase_params)
  File "/tmp/pytorch/xla/test/pjrt/test_dtypes.py", line 22, in test_float_round_trip
    torch.testing.assert_close(xt.cpu(), t)
  File "/opt/conda/lib/python3.8/site-packages/torch/testing/_comparison.py", line 1520, in assert_close
    raise error_metas[0].to_error(msg)
AssertionError: Tensor-likes are not close!

Mismatched elements: 9 / 9 (100.0%)
Greatest absolute difference: 1.3942428785896892 at index (2, 1) (up to 1e-07 allowed)
Greatest relative difference: 0.9994087028336766 at index (1, 0) (up to 1e-07 allowed)

I'll just remove this test case for now.

Remove obsolete device special cases

4513eb3

will-cromar added DO_NOT_REVIEW_YET runtime labels Dec 18, 2023

will-cromar added 4 commits December 18, 2023 21:07

remove comments

c438e3a

remove device from ConvertTo

825686b

remove device from ConvertToRaw

95cf14c

formatting

76d825f

will-cromar removed the DO_NOT_REVIEW_YET label Dec 18, 2023

will-cromar marked this pull request as ready for review December 18, 2023 22:25

will-cromar changed the title ~~[WIP] Remove obsolete device special cases~~ Remove obsolete device special cases Dec 18, 2023

will-cromar requested a review from JackCaoG December 18, 2023 22:25

JackCaoG reviewed Dec 19, 2023

View reviewed changes

will-cromar mentioned this pull request Dec 28, 2023

Improve PJRT C API support for GPUs and custom hardware #6242

Closed

9 tasks

will-cromar added 4 commits January 2, 2024 19:13

Add test_dtypes

6720594

Merge branch 'master' of github.com:pytorch/xla into wcromar/remove-o…

ac3d63e

…bsolete-cases

formatting

92e9a2c

remove complex128 test

5421d64

Generalize test_dtypes

bb04b65

JackCaoG reviewed Jan 2, 2024

View reviewed changes

JackCaoG approved these changes Jan 2, 2024

View reviewed changes

remove complex128 test again

37fff0f

will-cromar merged commit 0cd6f10 into master Jan 3, 2024
21 checks passed

mbzomowski pushed a commit to mbzomowski-test-org/xla that referenced this pull request Jan 3, 2024

Remove obsolete device special cases (pytorch#6197)

0a5c3e3

golechwierowicz pushed a commit that referenced this pull request Jan 12, 2024

Remove obsolete device special cases (#6197)

1df07f8

bhavya01 pushed a commit that referenced this pull request Apr 22, 2024

Remove obsolete device special cases (#6197)

f21d884

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove obsolete device special cases #6197

Remove obsolete device special cases #6197

will-cromar commented Dec 18, 2023

JackCaoG left a comment

will-cromar commented Jan 2, 2024

will-cromar commented Jan 2, 2024

JackCaoG commented Jan 2, 2024

will-cromar commented Jan 2, 2024

JackCaoG Jan 2, 2024

will-cromar Jan 2, 2024

jeffhataws Jan 3, 2024

will-cromar commented Jan 3, 2024

Remove obsolete device special cases #6197

Remove obsolete device special cases #6197

Conversation

will-cromar commented Dec 18, 2023

JackCaoG left a comment

Choose a reason for hiding this comment

will-cromar commented Jan 2, 2024

will-cromar commented Jan 2, 2024

JackCaoG commented Jan 2, 2024

will-cromar commented Jan 2, 2024

JackCaoG Jan 2, 2024

Choose a reason for hiding this comment

will-cromar Jan 2, 2024

Choose a reason for hiding this comment

jeffhataws Jan 3, 2024

Choose a reason for hiding this comment

will-cromar commented Jan 3, 2024