Requirements to pass WGMMA LHS operand in registers #4785

chsigg · 2024-09-23T18:47:36Z

NVIDIA is implementing an optimization to pass the LHS operand of WGMMA ops in register. This allows element-wise prologues to pass the intermediate result directly to WGMMA without writing it to shared memory like it currently does.

The OpenXLA team is currently reviewing NVIDIA's changes with the intent of eventually writing a PR against this repository. We heard through @ThomasRaoux and @gflegar that the Triton team is planning a similar feature, so it would be great to align the requirements. @ThomasRaoux, how can we best achieve this? Are you far along enough in the planning phase that you could provide some feedback to @ggengnv? Or would you prefer for us to do a round of reviews first?

Jokeren · 2024-09-23T19:56:38Z

We heard through @ThomasRaoux and @gflegar that the Triton team is planning a similar feature

What feature are you referring to? I don't think anyone on our side is working on this.

ThomasRaoux · 2024-09-24T02:49:56Z

@lezcano is currently working on a mixed mode kernel that will require this support but at this point there hasn't been much design work on the mmav3 specific part yet and there are a few steps before getting to that.

Looking at the changes in the link, it seems that there is some more work needed to productize it so maybe we should join efforts indeed. What lond of timeline did you have in mind for this work?

ggengnv · 2024-09-26T19:12:11Z

As an update, I've addressed existing PR comments and split the original PR into two for ease of review.

I'm currently on leave till 10/9 - will be happy to address feedback and resume effort on this once I return :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requirements to pass WGMMA LHS operand in registers #4785

Requirements to pass WGMMA LHS operand in registers #4785

chsigg commented Sep 23, 2024

Jokeren commented Sep 23, 2024

ThomasRaoux commented Sep 24, 2024

ggengnv commented Sep 26, 2024

Requirements to pass WGMMA LHS operand in registers #4785

Requirements to pass WGMMA LHS operand in registers #4785

Comments

chsigg commented Sep 23, 2024

Jokeren commented Sep 23, 2024

ThomasRaoux commented Sep 24, 2024

ggengnv commented Sep 26, 2024