Introduce fast path in the CPU equal op #100024

houseroad · 2023-04-25T20:52:09Z

Summary: When two tensors share the same storage, and strides, and no other flags, then we should consider this tensors as equal.

Test Plan: buck2 test @//mode/opt //caffe2/test:torch -- --exact 'caffe2/test:torch - test_equal (test_torch.TestTorch)'

Differential Revision: D45282119

pytorch-bot · 2023-04-25T20:52:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/100024

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3e92a89:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-04-25T20:53:02Z

This pull request was exported from Phabricator. Differential Revision: D45282119

albanD · 2023-04-25T20:55:19Z

aten/src/ATen/native/ReduceOps.cpp

+  // ensuring the storage and strides exactly the same.
+  if (self.sizes().equals(other.sizes())
+      && self.strides().equals(other.strides())
+      && self.storage().is_alias_of(other.storage())


Since storage is untyped, you should check the the dtype() matches here as well.
A good test for this would be:

import torch a = torch.rand((2, 2), dtype=torch.float) b = a.view(dtype=torch.int32) print(torch.equal(a, b))

Good point. Added

facebook-github-bot · 2023-04-25T23:28:36Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-25T23:50:34Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-26T00:08:10Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-26T16:40:51Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-26T16:50:45Z

This pull request was exported from Phabricator. Differential Revision: D45282119

ezyang · 2023-04-26T18:53:49Z

aten/src/ATen/native/ReduceOps.cpp

+      && self.storage_offset() == other.storage_offset()
+      && self.layout() == other.layout()
+      && self.is_neg() == other.is_neg()
+      && self.is_conj() == other.is_conj()) {


You don't need to test these three, they will have been handled before getting here.

Just to be safe, I am a bit concerned in some cases, if users directly this cpu_equal function directly, although this should be rare. Keeping these checks shouldn't hurt, or we concern about the overhead for these calls?

ezyang · 2023-04-26T18:53:57Z

aten/src/ATen/native/ReduceOps.cpp

+  // TensorIterator, it should be safe to have the following fast path by
+  // ensuring the storage and strides exactly the same.
+  if (self.dtype() == other.dtype()
+      && self.sizes().equals(other.sizes())


You don't need to test this, it's tested above

ezyang · 2023-04-26T18:54:19Z

aten/src/ATen/native/ReduceOps.cpp

+  // ensuring the storage and strides exactly the same.
+  if (self.dtype() == other.dtype()
+      && self.sizes().equals(other.sizes())
+      && self.strides().equals(other.strides())


A fastpath for this would be to instead assert both tensors are contiguous, before checking their strides

ezyang

You might also want this to apply to cuda too.

houseroad · 2023-04-26T19:44:02Z

Yeah, I will handle the CUDA one in the following PR.

facebook-github-bot · 2023-04-26T20:48:29Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-26T20:58:22Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-27T00:37:05Z

This pull request was exported from Phabricator. Differential Revision: D45282119

facebook-github-bot · 2023-04-27T00:47:01Z

This pull request was exported from Phabricator. Differential Revision: D45282119

Summary: Pull Request resolved: pytorch#100024 When two tensors share the same storage, and strides, and no other flags, then we should consider this tensors as equal. We have another approach in pytorch#99703, which is directly check equality in the JIT loader. However, we may have to handle the flags like neg/conj explicitly. It's a bit hard to cover all the cases. Per discussion with davidberard98, in the flags like neg/conj should be handled by the dispatcher already (and the TensorIterator logic also proves this), so adding the fast path to CPU and CUDA ops should be a better/safer approach. Test Plan: buck2 test @//mode/opt //caffe2/test:torch -- --exact 'caffe2/test:torch - test_equal (test_torch.TestTorch)' Reviewed By: hyuen Differential Revision: D45282119 fbshipit-source-id: 18e939d236a6d84a79013317db8b2f715f4a3cff

facebook-github-bot · 2023-04-27T00:57:41Z

This pull request was exported from Phabricator. Differential Revision: D45282119

`torch.equal(x, x)` should return false if one of `x` is a tenor of floating point values one of which could be NaN So, it renders some of the optimization proposed in #100024 invalid. Add regression test that calls torch.equal for tensor containing NaN Fixes #111251

`torch.equal(x, x)` should return false if one of `x` is a tenor of floats one of which is NaN. So, it renders some of the optimization proposed in #100024 invalid, though as result `torch.equal` will become much slower for identical floating point tensors. Add regression test that calls torch.equal for tensor containing NaN Fixes #111251 Pull Request resolved: #111699 Approved by: https://github.com/Skylion007, https://github.com/albanD

`torch.equal(x, x)` should return false if one of `x` is a tenor of floats one of which is NaN. So, it renders some of the optimization proposed in #100024 invalid, though as result `torch.equal` will become much slower for identical floating point tensors. Add regression test that calls torch.equal for tensor containing NaN Fixes #111251 Pull Request resolved: #111699 Approved by: https://github.com/Skylion007, https://github.com/albanD (cherry picked from commit 7709382)

`torch.equal(x, x)` should return false if one of `x` is a tenor of floats one of which is NaN. So, it renders some of the optimization proposed in pytorch#100024 invalid, though as result `torch.equal` will become much slower for identical floating point tensors. Add regression test that calls torch.equal for tensor containing NaN Fixes pytorch#111251 Pull Request resolved: pytorch#111699 Approved by: https://github.com/Skylion007, https://github.com/albanD

facebook-github-bot added the fb-exported label Apr 25, 2023

albanD reviewed Apr 25, 2023

View reviewed changes

houseroad force-pushed the export-D45282119 branch from cc2794e to 24306ba Compare April 25, 2023 23:28

houseroad force-pushed the export-D45282119 branch from 24306ba to 60fa585 Compare April 25, 2023 23:50

houseroad force-pushed the export-D45282119 branch from 60fa585 to 3119f66 Compare April 26, 2023 00:08

houseroad requested review from davidberard98 and ezyang April 26, 2023 16:30

houseroad force-pushed the export-D45282119 branch from 3119f66 to e7668cd Compare April 26, 2023 16:40

houseroad force-pushed the export-D45282119 branch from e7668cd to c93319b Compare April 26, 2023 16:50

ezyang reviewed Apr 26, 2023

View reviewed changes

ezyang approved these changes Apr 26, 2023

View reviewed changes

houseroad force-pushed the export-D45282119 branch from c93319b to 5973cf1 Compare April 26, 2023 20:48

houseroad force-pushed the export-D45282119 branch from 5973cf1 to 9c9595c Compare April 26, 2023 20:58

houseroad force-pushed the export-D45282119 branch from 9c9595c to ce625e6 Compare April 27, 2023 00:37

houseroad force-pushed the export-D45282119 branch from ce625e6 to 238234d Compare April 27, 2023 00:47

houseroad force-pushed the export-D45282119 branch from 238234d to 3e92a89 Compare April 27, 2023 00:57

facebook-github-bot merged commit d7fa7fa into pytorch:main Apr 28, 2023

huydhn mentioned this pull request Apr 30, 2023

DISABLED test_equal (__main__.TestTorch) #100340

Closed

This was referenced Oct 14, 2023

Version 2.1 breaks torch.equal when tensors contain nans #111251

Closed

Fix regression in torch.equal behavior for NaNs #111699

Closed

malfet mentioned this pull request Oct 25, 2023

[Release-2.1.1] Fix regression in torch.equal behavior for NaNs #111996

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce fast path in the CPU equal op #100024

Introduce fast path in the CPU equal op #100024

houseroad commented Apr 25, 2023

pytorch-bot bot commented Apr 25, 2023 •

edited

Loading

facebook-github-bot commented Apr 25, 2023

albanD Apr 25, 2023

houseroad Apr 26, 2023

facebook-github-bot commented Apr 25, 2023

facebook-github-bot commented Apr 25, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

ezyang Apr 26, 2023

houseroad Apr 26, 2023

ezyang Apr 26, 2023

ezyang Apr 26, 2023

ezyang left a comment

houseroad commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 27, 2023

facebook-github-bot commented Apr 27, 2023

facebook-github-bot commented Apr 27, 2023

Introduce fast path in the CPU equal op #100024

Introduce fast path in the CPU equal op #100024

Conversation

houseroad commented Apr 25, 2023

pytorch-bot bot commented Apr 25, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/100024

✅ No Failures

facebook-github-bot commented Apr 25, 2023

albanD Apr 25, 2023

Choose a reason for hiding this comment

houseroad Apr 26, 2023

Choose a reason for hiding this comment

facebook-github-bot commented Apr 25, 2023

facebook-github-bot commented Apr 25, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

ezyang Apr 26, 2023

Choose a reason for hiding this comment

houseroad Apr 26, 2023

Choose a reason for hiding this comment

ezyang Apr 26, 2023

Choose a reason for hiding this comment

ezyang Apr 26, 2023

Choose a reason for hiding this comment

ezyang left a comment

Choose a reason for hiding this comment

houseroad commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 26, 2023

facebook-github-bot commented Apr 27, 2023

facebook-github-bot commented Apr 27, 2023

facebook-github-bot commented Apr 27, 2023

pytorch-bot bot commented Apr 25, 2023 •

edited

Loading