Adapt a few `torch.utils.checkpoint` functions for PyTorch/XLA. #6178

ysiraichi · 2023-12-15T13:08:46Z

This PR re-implements get_device_states and set_device_states functions, used in CheckpointFunction, so as to work with PyTorch/XLA. Previously, they weren't a problem, since PyTorch was usually compiled without CUDA support.

cc @JackCaoG @miladm

JackCaoG · 2023-12-15T18:26:58Z

lol I copied paste this file mostly from upstream and hoping we can merge back one day. What;s the correct way of extending the checkpoint module do you know?

I copied this file because we need optimization_barrier_

ysiraichi · 2023-12-15T18:32:59Z

I have no idea. I just fixed the parts where device_module was needed, so that XLA would run successfully.

JackCaoG

LGTM, through I am not sure how is it different from the upstream one. Can you leave a comment for these two functions?

ysiraichi requested a review from JackCaoG December 15, 2023 13:08

ysiraichi added the xla:gpu label Dec 15, 2023

JackCaoG approved these changes Dec 15, 2023

View reviewed changes

ysiraichi mentioned this pull request Dec 15, 2023

Failing Torchbench Models: tracking issue #5932

Open

ysiraichi added 2 commits January 8, 2024 16:35

Re-implement torch.utils.checkpoint functions.

e436719

Fix lint issues.

3d0d771

ysiraichi force-pushed the ysiraichi/fix-checkpoint branch from 85ceb01 to 3d0d771 Compare January 8, 2024 19:36

ysiraichi added 2 commits January 8, 2024 16:44

Add comment.

d1266d6

Fix lint issues.

ddbb02e

ysiraichi merged commit ebb200b into master Jan 10, 2024
20 checks passed

bhavya01 pushed a commit that referenced this pull request Apr 22, 2024

Adapt a few torch.utils.checkpoint functions for PyTorch/XLA. (#6178)

04455c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adapt a few `torch.utils.checkpoint` functions for PyTorch/XLA. #6178

Adapt a few `torch.utils.checkpoint` functions for PyTorch/XLA. #6178

ysiraichi commented Dec 15, 2023

JackCaoG commented Dec 15, 2023

ysiraichi commented Dec 15, 2023

JackCaoG left a comment

Adapt a few torch.utils.checkpoint functions for PyTorch/XLA. #6178

Adapt a few torch.utils.checkpoint functions for PyTorch/XLA. #6178

Conversation

ysiraichi commented Dec 15, 2023

JackCaoG commented Dec 15, 2023

ysiraichi commented Dec 15, 2023

JackCaoG left a comment

Choose a reason for hiding this comment

Adapt a few `torch.utils.checkpoint` functions for PyTorch/XLA. #6178

Adapt a few `torch.utils.checkpoint` functions for PyTorch/XLA. #6178