JIT: Do greedy 4-opt for backward jumps in 3-opt layout #110277

amanasifkhalid · 2024-11-29T21:48:36Z

Part of #107749. Follow-up to #103450. Greedy 3-opt (i.e. an implementation that requires each move to be profitable on its own) is not well-suited for discovering profitable moves for backward jumps, as such movement requires an unrelated move to first place the source block lexically behind the destination block. Thus, the 3-opt implementation added in #103450 incorporates a 4-opt move for backward jumps, where we partition 1) before the destination block, 2) before the source block, and 3) directly after the source block. This 4-opt implementation can be expanded to search for the best cut point between the destination and source blocks to maximize its efficacy. Since we can compute the distance between the blocks, we can skip this linear search for large distances if it proves to be too expensive.

dotnet-policy-service · 2024-11-29T21:49:07Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

amanasifkhalid · 2024-12-02T05:07:40Z

cc @dotnet/jit-contrib, @AndyAyersMS PTAL. Diffs show this has more PerfScore improvements than regressions across platforms, for what it's worth. TP regressions seem to be inflated by some outlier in realworld, though I'm having a tough time narrowing it down without a working pin binary locally...

AndyAyersMS · 2024-12-03T17:59:26Z

I have a machine that can still run pin. If you want, I can try and find the problematic case or cases.

amanasifkhalid · 2024-12-03T18:55:16Z

I have a machine that can still run pin. If you want, I can try and find the problematic case or cases.

Thank you for the offer! I might be able to hunt it down manually -- I'll let you know how that goes.

amanasifkhalid · 2024-12-03T18:56:14Z

Also, the code looks uglier, but pre-computing part of the partition cost improved TP across the board by quite a bit.

amanasifkhalid · 2024-12-03T20:38:20Z

Thank you for the offer! I might be able to hunt it down manually -- I'll let you know how that goes.

The pathological case is the same as the one in #109521: We have a method with over a thousand basic blocks, and some blocks that are interesting to 3/4-opt have hundreds of predecessors. To compute the costs of potential cut points, we have to iterate up to every single predecessor edge into each block to the right of a cut point; with the previous implementation of this PR, this meant iterating up to 780 predecessor edges, dozens of times. With the new logic for precomputing some parts of the cost, the TP cost for this particular method drops from over 160% to 3.9%, hence why the TP diffs look far less dramatic overall.

AndyAyersMS

Large pred lists are a frequent source of trouble. Glad you were able to track down the problematic case.

amanasifkhalid · 2024-12-03T21:24:25Z

/ba-g Build analysis blocked by #110173

Part of dotnet#107749. Follow-up to dotnet#103450. Greedy 3-opt (i.e. an implementation that requires each move to be profitable on its own) is not well-suited for discovering profitable moves for backward jumps, as such movement requires an unrelated move to first place the source block lexically behind the destination block. Thus, the 3-opt implementation added in dotnet#103450 incorporates a 4-opt move for backward jumps, where we partition 1) before the destination block, 2) before the source block, and 3) directly after the source block. This 4-opt implementation can be expanded to search for the best cut point between the destination and source blocks to maximize its efficacy.

Follow-up to #110277. Fixes #110756. Don't consider 4-opt cut points that would move the entry block of a try/handler region below other blocks in the region. Previously, either future moves would put the entry block back at the top of the region, or we would get unlucky in the rare case and hit asserts.

amanasifkhalid added 2 commits November 29, 2024 16:07

Move partition cost computation into helper

7e906ae

Search for best cut point for backward jumps

539d547

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Nov 29, 2024

dotnet-policy-service bot assigned amanasifkhalid Nov 29, 2024

build-analysis bot mentioned this pull request Nov 30, 2024

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

3 tasks

am11 mentioned this pull request Dec 1, 2024

System.Formats.Nrbf.Tests time out on win-x86 #110289

Closed

build-analysis bot mentioned this pull request Dec 1, 2024

System.Formats.Nrbf.Tests timeouts #110285

Closed

amanasifkhalid marked this pull request as ready for review December 2, 2024 05:04

Pre-compute part of partition cost

8694e6b

build-analysis bot mentioned this pull request Dec 3, 2024

iOS tests failing with WORKLOAD TIMED OUT - Killing user command. #108103

Open

AndyAyersMS approved these changes Dec 3, 2024

View reviewed changes

amanasifkhalid merged commit 6d12a30 into dotnet:main Dec 3, 2024
102 of 108 checks passed

amanasifkhalid deleted the greedy-4-opt branch December 3, 2024 21:25

amanasifkhalid mentioned this pull request Dec 3, 2024

JIT: Flowgraph Modernization and Improved Block Layout in .NET 10 #107749

Open

35 tasks

LoopedBard3 mentioned this pull request Dec 10, 2024

[Perf] Linux/x64: 21 Regressions on 12/3/2024 10:12:58 PM #110581

Open

This was referenced Dec 12, 2024

[Perf] Linux/x64: 1 Regression on 11/18/2024 1:48:19 AM dotnet/perf-autofiling-issues#45035

Closed

[Perf] Windows/x64: 1 Regression on 12/9/2024 2:48:56 PM dotnet/perf-autofiling-issues#46608

Closed

amanasifkhalid mentioned this pull request Dec 23, 2024

JIT: Don't move EH region entries when aligning backward jumps #110918

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Do greedy 4-opt for backward jumps in 3-opt layout #110277

JIT: Do greedy 4-opt for backward jumps in 3-opt layout #110277

amanasifkhalid commented Nov 29, 2024 •

edited

Loading

dotnet-policy-service bot commented Nov 29, 2024

amanasifkhalid commented Dec 2, 2024

AndyAyersMS commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

AndyAyersMS left a comment

amanasifkhalid commented Dec 3, 2024

JIT: Do greedy 4-opt for backward jumps in 3-opt layout #110277

JIT: Do greedy 4-opt for backward jumps in 3-opt layout #110277

Conversation

amanasifkhalid commented Nov 29, 2024 • edited Loading

dotnet-policy-service bot commented Nov 29, 2024

amanasifkhalid commented Dec 2, 2024

AndyAyersMS commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

amanasifkhalid commented Dec 3, 2024

AndyAyersMS left a comment

Choose a reason for hiding this comment

amanasifkhalid commented Dec 3, 2024

amanasifkhalid commented Nov 29, 2024 •

edited

Loading