Add data-type promotion to `stack`. #7091

ysiraichi · 2024-05-22T00:42:22Z

This PR adds data-type promotion to stack operation. Previously, there was none. So, the kernel implicitly expected the arguments to be of the same data-type. This might not be the case when using AMP.

cc @miladm @JackCaoG

JackCaoG · 2024-05-22T00:54:15Z

torch_xla/csrc/aten_xla_type.cpp

@@ -3158,8 +3158,12 @@ at::Tensor XLANativeFunctions::squeeze_copy(const at::Tensor& self,

 at::Tensor XLANativeFunctions::stack(at::TensorList tensors, int64_t dim) {
  TORCH_LAZY_FN_COUNTER_TIMED_TRACING("xla::");
+  at::ScalarType result_type = at::native::result_type(tensors);
+  std::vector<at::Tensor> c_tensors(tensors.size());


is stack expecting input tensor to be CPU? std::vector<at::Tensor> c_tensors will return a list of tenosrs on CPU right?

I don't think so. Unless I'm missing something, they are casted tensors, on XLA.

Then I am abit confused. Reading your code, you init the c_tensors vector which I assume they will be cpu tensors since you didn;t provide the device type. In the later code you only update the dtype of these c_tensors, I don't know when are they moved to the XLA device.

Here's a summary of what this code is doing: considering the arguments tensors (a list of XLA tensors) and dim, the function:

Computes the common data-type of all tensors: result_type

Converts each tensor to the common data-type, storing the result in c_tensors (as in "cast tensors")

Calls tensor_methods::stack with the casted tensors

Oh I see. transform is called with tensors.begin()..

ysiraichi added the xla:gpu label May 22, 2024

ysiraichi requested a review from JackCaoG May 22, 2024 00:42

JackCaoG reviewed May 22, 2024

View reviewed changes

ysiraichi added 2 commits May 22, 2024 11:18

Add test.

af63bd8

Add dtype promotion to stack.

5fbcdd9

ysiraichi force-pushed the ysiraichi/fix-stack-dtype-promotion branch from 57352fb to 5fbcdd9 Compare May 22, 2024 14:18

ysiraichi requested a review from JackCaoG May 22, 2024 18:54

JackCaoG approved these changes May 22, 2024

View reviewed changes

ysiraichi mentioned this pull request May 22, 2024

upsample_bilinear2d HLO returns unexpected data-type. #7095

Closed

ysiraichi merged commit a299f33 into master May 23, 2024
20 checks passed

ysiraichi mentioned this pull request May 25, 2024

Failing Torchbench Models: tracking issue #5932

Open

qihqi pushed a commit that referenced this pull request May 29, 2024

Add data-type promotion to stack. (#7091)

157f8f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add data-type promotion to `stack`. #7091

Add data-type promotion to `stack`. #7091

ysiraichi commented May 22, 2024

JackCaoG May 22, 2024

ysiraichi May 22, 2024 •

edited

Loading

JackCaoG May 22, 2024

ysiraichi May 22, 2024

JackCaoG May 22, 2024

Add data-type promotion to stack. #7091

Add data-type promotion to stack. #7091

Conversation

ysiraichi commented May 22, 2024

JackCaoG May 22, 2024

Choose a reason for hiding this comment

ysiraichi May 22, 2024 • edited Loading

Choose a reason for hiding this comment

JackCaoG May 22, 2024

Choose a reason for hiding this comment

ysiraichi May 22, 2024

Choose a reason for hiding this comment

JackCaoG May 22, 2024

Choose a reason for hiding this comment

Add data-type promotion to `stack`. #7091

Add data-type promotion to `stack`. #7091

ysiraichi May 22, 2024 •

edited

Loading