Fix TTIR to TTNN conversion for all gather #1182

gfengTT · 2024-11-06T20:35:18Z

This fixes the incorrect conversion of the ttnn all_gather op. It should not use the output of an EmptyOp as the input to all_gather.

Testing with:

./build/bin/ttmlir-opt --ttir-to-ttnn-backend-pipeline test/ttmlir/Dialect/TTNN/ccl/all_gather.mlir

Before:

  func.func @forward(%arg0: tensor<1x1x32x32xbf16, #layout>) -> tensor<1x1x32x128xbf16, #layout1> {
    %0 = "ttnn.get_device"() <{mesh_shape = #ttnn<mesh_shape 1x1>}> : () -> !tt.device<#device>
    %1 = "ttnn.to_device"(%arg0, %0) <{memory_config = #ttnn.memory_config<<interleaved>, <dram>, <<1x1>>>}> : (tensor<1x1x32x32xbf16, #layout>, !tt.device<#device>) -> tensor<1x1x32x32xbf16, #layout2>
    %2 = "ttnn.to_layout"(%1) <{layout = #ttnn.layout<tile>}> : (tensor<1x1x32x32xbf16, #layout2>) -> tensor<1x1x32x32xbf16, #layout2>
    "ttnn.dealloc"(%2) : (tensor<1x1x32x32xbf16, #layout2>) -> ()
    "ttnn.dealloc"(%1) : (tensor<1x1x32x32xbf16, #layout2>) -> ()
    %3 = "ttnn.empty"(%0) <{dtype = #tt.supportedDataTypes<bf16>, layout = #ttnn.layout<tile>, memory_config = #ttnn.memory_config<<interleaved>, <dram>, <<1x1>>>, shape = #ttnn.shape<1x1x32x32>}> : (!tt.device<#device>) -> tensor<1x1x32x32xbf16, #layout2>
    %4 = "ttnn.all_gather"(%3) <{dim = 3 : si32, num_links = 1 : si32}> : (tensor<1x1x32x32xbf16, #layout2>) -> tensor<1x1x32x128xbf16, #layout3>
    "ttnn.dealloc"(%3) : (tensor<1x1x32x32xbf16, #layout2>) -> ()
    %5 = "ttnn.from_device"(%4) : (tensor<1x1x32x128xbf16, #layout3>) -> tensor<1x1x32x128xbf16, #layout1>
    "ttnn.dealloc"(%4) : (tensor<1x1x32x128xbf16, #layout3>) -> ()
    %6 = "ttnn.to_layout"(%5) <{layout = #ttnn.layout<row_major>}> : (tensor<1x1x32x128xbf16, #layout1>) -> tensor<1x1x32x128xbf16, #layout1>
    "ttnn.dealloc"(%5) : (tensor<1x1x32x128xbf16, #layout1>) -> ()
    return %6 : tensor<1x1x32x128xbf16, #layout1>
  }

New:

  func.func @forward(%arg0: tensor<1x1x32x32xbf16, #layout>) -> tensor<1x1x32x128xbf16, #layout1> {
    %0 = "ttnn.get_device"() <{mesh_shape = #ttnn<mesh_shape 1x1>}> : () -> !tt.device<#device>
    %1 = "ttnn.to_device"(%arg0, %0) <{memory_config = #ttnn.memory_config<<interleaved>, <dram>, <<1x1>>>}> : (tensor<1x1x32x32xbf16, #layout>, !tt.device<#device>) -> tensor<1x1x32x32xbf16, #layout2>
    %2 = "ttnn.to_layout"(%1) <{layout = #ttnn.layout<tile>}> : (tensor<1x1x32x32xbf16, #layout2>) -> tensor<1x1x32x32xbf16, #layout2>
    "ttnn.dealloc"(%1) : (tensor<1x1x32x32xbf16, #layout2>) -> ()
    %3 = "ttnn.all_gather"(%2) <{dim = 3 : si32, num_links = 1 : si32}> : (tensor<1x1x32x32xbf16, #layout2>) -> tensor<1x1x32x128xbf16, #layout3>
    "ttnn.dealloc"(%2) : (tensor<1x1x32x32xbf16, #layout2>) -> ()
    %4 = "ttnn.from_device"(%3) : (tensor<1x1x32x128xbf16, #layout3>) -> tensor<1x1x32x128xbf16, #layout1>
    "ttnn.dealloc"(%3) : (tensor<1x1x32x128xbf16, #layout3>) -> ()
    %5 = "ttnn.to_layout"(%4) <{layout = #ttnn.layout<row_major>}> : (tensor<1x1x32x128xbf16, #layout1>) -> tensor<1x1x32x128xbf16, #layout1>
    "ttnn.dealloc"(%4) : (tensor<1x1x32x128xbf16, #layout1>) -> ()
    return %5 : tensor<1x1x32x128xbf16, #layout1>
  }

nsmithtt · 2024-11-07T12:35:42Z

Hold off on landing this, I need to understand this better. I think we probably want to keep all gather dps.

nsmithtt

nvm, OK looks good!

nsmithtt · 2024-12-09T02:00:30Z

FYI @wooseokTT

gfengTT requested review from sdjordjevicTT, svuckovicTT, mtopalovicTT, rpavlovicTT and jserbedzijaTT as code owners November 6, 2024 20:35

gfengTT requested a review from wooseokTT November 6, 2024 20:35

gfengTT marked this pull request as draft November 6, 2024 21:28

svuckovicTT approved these changes Nov 7, 2024

View reviewed changes

nsmithtt approved these changes Nov 7, 2024

View reviewed changes

gfengTT force-pushed the gfeng/fix-ttnn-conversion-of-all-gather branch from 0c10623 to b158d23 Compare December 5, 2024 20:16

gfengTT marked this pull request as ready for review December 5, 2024 20:16

gfengTT mentioned this pull request Dec 5, 2024

Enable lowering ttir all_reduce and mesh_shard to ttnn and flatbuffer #1432

Merged

Fix TTIR to TTNN conversion for all gather

2a61f70

gfengTT force-pushed the gfeng/fix-ttnn-conversion-of-all-gather branch from b158d23 to 2a61f70 Compare December 6, 2024 14:51

gfengTT enabled auto-merge (squash) December 6, 2024 14:51

gfengTT merged commit e052bae into main Dec 6, 2024
20 checks passed

gfengTT deleted the gfeng/fix-ttnn-conversion-of-all-gather branch December 6, 2024 16:13

azecevicTT pushed a commit that referenced this pull request Dec 17, 2024

Fix TTIR to TTNN conversion for all gather (#1182)

cbf0b3a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TTIR to TTNN conversion for all gather #1182

Fix TTIR to TTNN conversion for all gather #1182

gfengTT commented Nov 6, 2024

nsmithtt commented Nov 7, 2024

nsmithtt left a comment

nsmithtt commented Dec 9, 2024

Fix TTIR to TTNN conversion for all gather #1182

Fix TTIR to TTNN conversion for all gather #1182

Conversation

gfengTT commented Nov 6, 2024

nsmithtt commented Nov 7, 2024

nsmithtt left a comment

Choose a reason for hiding this comment

nsmithtt commented Dec 9, 2024