Properly lower add and mul #6731

wonjoolee95 · 2024-03-13T01:00:21Z

Partly fixes #6589

torch_xla/csrc/elementwise.h

wonjoolee95 · 2024-03-13T06:20:21Z

I noticed that one of the SPMD unit tests that checks HLOs failed previously and seems like the HLOs are slightly changed due to this PR because it updates the lowering logic. In this simple SPMD unit test, the HLO without this change is:

ENTRY %IrToHlo.14 (p0.4: f32[1,128], p1.5: f32[1,128]) -> (f32[1,128]) {
  %p1.5 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.4 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.3 = f32[] constant(1)
  %broadcast.6 = f32[1,128]{1,0} broadcast(f32[] %constant.3), dimensions={}
  %multiply.7 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.4, f32[1,128]{1,0} %broadcast.6)
  %add.8 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.5, f32[1,128]{1,0} %multiply.7)
  %custom-call.9 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.8), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.10 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.11 = f32[1,128]{1,0} broadcast(f32[] %multiply.10), dimensions={}
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.9, f32[1,128]{1,0} %broadcast.11)
  ROOT %tuple.13 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.12)
}

And the updated HLO looks like:

ENTRY %IrToHlo.14 (p0.5: f32[1,128], p1.8: f32[1,128]) -> (f32[1,128]) {
  %p1.8 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.5 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.4 = f32[] constant(1)
  %broadcast.6 = f32[1,128]{1,0} broadcast(f32[] %constant.4), dimensions={}
  %multiply.7 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.5, f32[1,128]{1,0} %broadcast.6)
  %add.9 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.8, f32[1,128]{1,0} %multiply.7)
  %custom-call.10 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.9), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.3 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.11 = f32[1,128]{1,0} broadcast(f32[] %multiply.3), dimensions={}
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.10, f32[1,128]{1,0} %broadcast.11)
  ROOT %tuple.13 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.12)
}

The contents of the HLOs are the same, the only things that differ is the suffix numbers, ex %custom-call.9 -> %custom-call.10.

Synced offline with @yeounoh, this is mostly fine as it's just the numbering changed in the HLO.

torch_xla/csrc/elementwise.cpp

wonjoolee95 · 2024-03-13T21:56:29Z

Previously CI was all green, the most recent commit just rebases with master. I'll merge this now to make the rc1 branch cut.

Co-authored-by: Wonjoo Lee <wonjoo@google.com>

wonjoolee95 requested a review from bhavya01 March 13, 2024 01:00

wonjoolee95 force-pushed the wonjoo/lower-ops-properly branch from 9d14293 to 997c79a Compare March 13, 2024 01:06

anw90 reviewed Mar 13, 2024

View reviewed changes

torch_xla/csrc/elementwise.h Outdated Show resolved Hide resolved

anw90 reviewed Mar 13, 2024

View reviewed changes

torch_xla/csrc/elementwise.h Outdated Show resolved Hide resolved

bhavya01 reviewed Mar 13, 2024

View reviewed changes

torch_xla/csrc/elementwise.cpp Show resolved Hide resolved

bhavya01 self-requested a review March 13, 2024 20:27

wonjoolee95 added 4 commits March 13, 2024 21:33

Properly lower add and mul

965620e

Run linter

adf2863

Fix typo

243712a

Update existing unit tests for aten::add HLO dumps

29c443b

bhavya01 approved these changes Mar 13, 2024

View reviewed changes

Fix unit tests after rebasing with master

1306408

wonjoolee95 force-pushed the wonjoo/lower-ops-properly branch from a24d373 to 1306408 Compare March 13, 2024 21:52

wonjoolee95 merged commit fe3f23c into master Mar 13, 2024
2 of 3 checks passed

wonjoolee95 mentioned this pull request Mar 13, 2024

2.3 backport PR request list #6676

Closed

lsy323 pushed a commit that referenced this pull request Mar 13, 2024

Properly lower add and mul (#6731)

66eb2dc

lsy323 added the backport_2.3 label Mar 13, 2024

lsy323 added a commit that referenced this pull request Mar 13, 2024

Properly lower add and mul (#6731) (#6744)

f9c94d9

Co-authored-by: Wonjoo Lee <wonjoo@google.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Properly lower add and mul #6731

Properly lower add and mul #6731

wonjoolee95 commented Mar 13, 2024

wonjoolee95 commented Mar 13, 2024 •

edited

Loading

wonjoolee95 commented Mar 13, 2024

Properly lower add and mul #6731

Properly lower add and mul #6731

Conversation

wonjoolee95 commented Mar 13, 2024

wonjoolee95 commented Mar 13, 2024 • edited Loading

wonjoolee95 commented Mar 13, 2024

wonjoolee95 commented Mar 13, 2024 •

edited

Loading