Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device #6644

changm · 2024-02-29T00:22:50Z

Currently only works for inference. The assumptions don't hold for training with Autograd yet.

test/dynamo/test_dynamo.py

torch_xla/core/dynamo_bridge.py

JackCaoG · 2024-03-07T18:03:14Z

Is this an experimental pr or you want to merge this?

changm · 2024-03-07T22:52:21Z

Is this an experimental pr or you want to merge this?

Ideally we would merge this, or is there a reason not to?

test/dynamo/test_dynamo.py

torch_xla/core/dynamo_bridge.py

… device

test/dynamo/test_dynamo.py

torch_xla/core/dynamo_bridge.py

JackCaoG · 2024-03-11T21:59:04Z

torch_xla/core/dynamo_bridge.py

@@ -387,6 +446,12 @@ def optimized_mod(*args):
    nonlocal xla_args_need_update
    nonlocal skip_checking_input_sharding_threashold

+    original_device: torch.device = _get_input_arg_device(args)
+    is_cuda_args: bool = _args_on_cuda(args)


_args_on_cuda will call _get_input_arg_device which is redundant.

I still think it's a little cleaner to do the redundant call, but removed the call here.

torch_xla/core/dynamo_bridge.py

vanbasten23 · 2024-03-13T18:26:40Z

Sorry for being late. It's looking good!

…A device (pytorch#6644)

changm self-assigned this Feb 29, 2024

changm requested a review from vanbasten23 February 29, 2024 00:24

vanbasten23 reviewed Mar 1, 2024

View reviewed changes

test/dynamo/test_dynamo.py Outdated Show resolved Hide resolved

vanbasten23 reviewed Mar 1, 2024

View reviewed changes

test/dynamo/test_dynamo.py Outdated Show resolved Hide resolved

vanbasten23 reviewed Mar 5, 2024

View reviewed changes

test/dynamo/test_dynamo.py Outdated Show resolved Hide resolved

vanbasten23 requested a review from JackCaoG March 5, 2024 21:49

changm requested a review from golechwierowicz March 6, 2024 15:11

golechwierowicz reviewed Mar 7, 2024

View reviewed changes

torch_xla/core/dynamo_bridge.py Outdated Show resolved Hide resolved

changm force-pushed the changm/automove branch from 476edaf to 2f7c0cc Compare March 7, 2024 14:51

changm requested a review from vanbasten23 March 7, 2024 14:52

changm force-pushed the changm/automove branch from f50e471 to 195978d Compare March 7, 2024 20:38

JackCaoG reviewed Mar 8, 2024

View reviewed changes

test/dynamo/test_dynamo.py Show resolved Hide resolved

JackCaoG reviewed Mar 8, 2024

View reviewed changes

torch_xla/core/dynamo_bridge.py Outdated Show resolved Hide resolved

JackCaoG reviewed Mar 8, 2024

View reviewed changes

torch_xla/core/dynamo_bridge.py Outdated Show resolved Hide resolved

changm changed the title ~~Automatically move non XLA Tensors to XLA Device and back to original device.~~ Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device Mar 11, 2024

Automove CUDA tensors to xla device if they aren't already on the XLA…

dca9877

… device

changm force-pushed the changm/automove branch from b3844a1 to dca9877 Compare March 11, 2024 15:16

changm requested review from JackCaoG and golechwierowicz March 11, 2024 19:57

JackCaoG reviewed Mar 11, 2024

View reviewed changes

test/dynamo/test_dynamo.py Show resolved Hide resolved

JackCaoG reviewed Mar 11, 2024

View reviewed changes

torch_xla/core/dynamo_bridge.py Show resolved Hide resolved

JackCaoG reviewed Mar 11, 2024

View reviewed changes

ysiraichi reviewed Mar 11, 2024

View reviewed changes

torch_xla/core/dynamo_bridge.py Show resolved Hide resolved

torch_xla/core/dynamo_bridge.py Show resolved Hide resolved

torch_xla/core/dynamo_bridge.py Show resolved Hide resolved

changm added 2 commits March 11, 2024 22:31

Clear metrics for test and remove redunant call

e03223c

Assert device exists when moving to a target device

db1adae

changm requested review from ysiraichi and JackCaoG March 11, 2024 23:02

JackCaoG approved these changes Mar 12, 2024

View reviewed changes

changm merged commit d13ae1b into master Mar 13, 2024
18 checks passed

changm deleted the changm/automove branch March 13, 2024 16:41

yitongh pushed a commit to AlibabaPAI/xla that referenced this pull request Oct 11, 2024

Automatically move CUDA non XLA Tensors to XLA Device and back to CUD…

029090f

…A device (pytorch#6644)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device #6644

Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device #6644

changm commented Feb 29, 2024 •

edited

Loading

JackCaoG commented Mar 7, 2024

changm commented Mar 7, 2024

JackCaoG Mar 11, 2024

changm Mar 11, 2024

vanbasten23 commented Mar 13, 2024

Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device #6644

Automatically move CUDA non XLA Tensors to XLA Device and back to CUDA device #6644

Conversation

changm commented Feb 29, 2024 • edited Loading

JackCaoG commented Mar 7, 2024

changm commented Mar 7, 2024

JackCaoG Mar 11, 2024

Choose a reason for hiding this comment

changm Mar 11, 2024

Choose a reason for hiding this comment

vanbasten23 commented Mar 13, 2024

changm commented Feb 29, 2024 •

edited

Loading