Enable MIR inlining for generators too #99782

cjgillot · 2022-07-26T20:47:14Z

This is a tentative to enable MIR inlining on generators too.

This PR proceeds in 2 steps:

the generator information is separated from the MIR body itself, to be computed using two separate queries: mir_generator_lowered and mir_generator_info;
we perform inlining on the generator body (generator_resume function).

This PR took the opportunity to simplify the generated generator_drop code, since it won't be optimized on its own.

rustbot · 2022-07-26T20:47:17Z

Some changes occurred to MIR optimizations

cc @rust-lang/wg-mir-opt

rust-highfive · 2022-07-26T20:47:18Z

r? @fee1-dead

(rust-highfive has picked a reviewer for you, use r? to override)

cjgillot · 2022-07-26T21:07:17Z

@bors try @rust-timer queue

rust-timer · 2022-07-26T21:07:19Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-07-26T21:07:25Z

⌛ Trying commit a4a8252e576a22eae530dc638bf98102f566f4d8 with merge ac9b18299e23be0ac59586be05a93e8e9e0f54d8...

bors · 2022-07-26T23:35:27Z

☀️ Try build successful - checks-actions
Build commit: ac9b18299e23be0ac59586be05a93e8e9e0f54d8 (ac9b18299e23be0ac59586be05a93e8e9e0f54d8)

rust-timer · 2022-07-26T23:35:29Z

Queued ac9b18299e23be0ac59586be05a93e8e9e0f54d8 with parent c11207e, future comparison URL.

rust-timer · 2022-07-27T00:54:54Z

Finished benchmarking commit (ac9b18299e23be0ac59586be05a93e8e9e0f54d8): comparison url.

Instruction count

Primary benchmarks: 😿 relevant regressions found
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	0.5%	0.9%	41
Regressions 😿 (secondary)	0.9%	4.1%	14
Improvements 🎉 (primary)	N/A	N/A	0
Improvements 🎉 (secondary)	-1.7%	-2.1%	5
All 😿🎉 (primary)	0.5%	0.9%	41

Max RSS (memory usage)

Results

Primary benchmarks: mixed results
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	1.8%	2.2%	4
Regressions 😿 (secondary)	2.9%	3.1%	2
Improvements 🎉 (primary)	-1.8%	-2.2%	10
Improvements 🎉 (secondary)	-3.3%	-5.9%	4
All 😿🎉 (primary)	-0.8%	-2.2%	14

Cycles

Results

Primary benchmarks: no relevant changes found
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	5.3%	5.3%	1
Improvements 🎉 (primary)	N/A	N/A	0
Improvements 🎉 (secondary)	-3.1%	-3.9%	5
All 😿🎉 (primary)	N/A	N/A	0

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.
Warning ⚠: The following benchmark(s) failed to build:

deeply-nested-multi

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

the arithmetic mean of the percent change ↩ ↩² ↩³
number of relevant changes ↩ ↩² ↩³

fee1-dead · 2022-07-27T04:25:27Z

r? rust-lang/compiler

cjgillot · 2022-07-27T17:27:25Z

The issue with deeply-nested-multi benchmark is only a recursion limit issue.

bors · 2022-07-28T19:58:50Z

☔ The latest upstream changes (presumably #99780) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2022-08-09T14:31:46Z

☔ The latest upstream changes (presumably #100089) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2022-08-22T18:27:53Z

☔ The latest upstream changes (presumably #99908) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2022-08-25T02:53:07Z

☔ The latest upstream changes (presumably #99946) made this pull request unmergeable. Please resolve the merge conflicts.

cjgillot · 2022-08-27T09:25:44Z

@bors try @rust-timer queue

rust-timer · 2022-08-27T09:25:45Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-08-27T09:25:52Z

⌛ Trying commit 24722aac770190c699585342078635d9e478696b with merge f33592845aa45f6bdcf5e797c6209813fe179704...

cjgillot · 2022-08-27T09:31:20Z

@bors try @rust-timer queue

rust-timer · 2022-08-27T09:31:21Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-08-27T09:31:27Z

⌛ Trying commit 9b0c4be7988c3d7906f99e86af021fd9fe6199dd with merge 22ba33181fb0c3e6588779a40a43dae8865e3993...

bors · 2022-08-27T11:21:51Z

☀️ Try build successful - checks-actions
Build commit: 22ba33181fb0c3e6588779a40a43dae8865e3993 (22ba33181fb0c3e6588779a40a43dae8865e3993)

rust-timer · 2022-08-27T11:21:53Z

Queued 22ba33181fb0c3e6588779a40a43dae8865e3993 with parent d0e1491, future comparison URL.

rust-timer · 2022-08-28T11:42:10Z

Finished benchmarking commit (22ba33181fb0c3e6588779a40a43dae8865e3993): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

Warning ⚠: The following benchmark(s) failed to build:

deeply-nested-multi

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean¹	range	count²
Regressions ❌ (primary)	0.5%	[0.2%, 0.9%]	57
Regressions ❌ (secondary)	0.9%	[0.2%, 4.1%]	22
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-2.0%	[-2.5%, -1.3%]	5
All ❌✅ (primary)	0.5%	[0.2%, 0.9%]	57

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean¹	range	count²
Regressions ❌ (primary)	1.8%	[0.5%, 2.5%]	5
Regressions ❌ (secondary)	2.7%	[1.4%, 5.0%]	7
Improvements ✅ (primary)	-3.6%	[-4.1%, -3.0%]	2
Improvements ✅ (secondary)	-2.2%	[-2.8%, -1.7%]	3
All ❌✅ (primary)	0.2%	[-4.1%, 2.5%]	7

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean¹	range	count²
Regressions ❌ (primary)	2.0%	[2.0%, 2.0%]	1
Regressions ❌ (secondary)	3.3%	[3.3%, 3.3%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	2.0%	[2.0%, 2.0%]	1

the arithmetic mean of the percent change ↩ ↩² ↩³
number of relevant changes ↩ ↩² ↩³

rustbot · 2022-09-07T22:27:35Z

This PR changes MIR

cc @oli-obk, @RalfJung, @JakobDegen, @davidtwco, @celinval, @vakaras

bors · 2022-09-13T21:33:38Z

☔ The latest upstream changes (presumably #101086) made this pull request unmergeable. Please resolve the merge conflicts.

cjgillot · 2022-09-22T17:21:59Z

From perf results, this does not seem to be worth it.

JakobDegen · 2022-09-22T17:51:52Z

I agree that we should maybe not be turning this on right now, but I'd also caution against just using perf results here. Compile time perf is not the same as runtime perf, and also in general this will enable other optimizations that we may take advantage of in the future. I expect that we will want to revive this PR at some point.

wesleywiser · 2022-09-23T17:17:10Z

Agreed, I think there might be code quality reasons to enable this in the future. I do wonder though if inlining into generators is what we want. It seems plausible to me that keeping the generator code as small as possible might enable better inlining of the generator into its callsite which could also be beneficial to runtime performance.

JakobDegen · 2022-09-23T19:54:17Z

Hmm, what makes you think that generators are more affected by this than any other function?

rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Jul 26, 2022

rust-highfive assigned fee1-dead Jul 26, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 26, 2022

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jul 26, 2022

cjgillot mentioned this pull request Jul 26, 2022

Bug: Very inefficient code generated for async functions setup (and likely for generators in general) #99504

Open

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Jul 27, 2022

rust-highfive assigned wesleywiser and unassigned fee1-dead Jul 27, 2022

This comment has been minimized.

Sign in to view

cjgillot force-pushed the inline-generator branch 2 times, most recently from 457f7df to 10c9543 Compare July 27, 2022 19:48

cjgillot force-pushed the inline-generator branch from 10c9543 to 6ff150c Compare July 28, 2022 20:47

cjgillot force-pushed the inline-generator branch from 6ff150c to 1a47e5d Compare August 9, 2022 19:46

cjgillot force-pushed the inline-generator branch from 1a47e5d to c75d32c Compare August 22, 2022 22:13

cjgillot force-pushed the inline-generator branch from c75d32c to 24722aa Compare August 27, 2022 09:25

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Aug 27, 2022

cjgillot force-pushed the inline-generator branch from 24722aa to 9b0c4be Compare August 27, 2022 09:27

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Aug 28, 2022

cjgillot added 3 commits September 8, 2022 00:27

Separate generator info from MIR body.

75121fc

Simplify generator_drop since it won't be optimized.

0369ff2

Enable MIR inlining on generators.

3128b8f

cjgillot force-pushed the inline-generator branch from 9b0c4be to 3128b8f Compare September 7, 2022 22:27

cjgillot mentioned this pull request Sep 7, 2022

Separate generator info from MIR body. #101547

Closed

cjgillot closed this Sep 22, 2022

cjgillot deleted the inline-generator branch October 1, 2022 17:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable MIR inlining for generators too #99782

Enable MIR inlining for generators too #99782

cjgillot commented Jul 26, 2022

rustbot commented Jul 26, 2022

rust-highfive commented Jul 26, 2022

cjgillot commented Jul 26, 2022

rust-timer commented Jul 26, 2022

bors commented Jul 26, 2022

bors commented Jul 26, 2022

rust-timer commented Jul 26, 2022

rust-timer commented Jul 27, 2022

fee1-dead commented Jul 27, 2022

cjgillot commented Jul 27, 2022

This comment has been minimized.

bors commented Jul 28, 2022

bors commented Aug 9, 2022

bors commented Aug 22, 2022

bors commented Aug 25, 2022

cjgillot commented Aug 27, 2022

rust-timer commented Aug 27, 2022

bors commented Aug 27, 2022

cjgillot commented Aug 27, 2022

rust-timer commented Aug 27, 2022

bors commented Aug 27, 2022

bors commented Aug 27, 2022

rust-timer commented Aug 27, 2022

rust-timer commented Aug 28, 2022

rustbot commented Sep 7, 2022

bors commented Sep 13, 2022

cjgillot commented Sep 22, 2022

JakobDegen commented Sep 22, 2022

wesleywiser commented Sep 23, 2022

JakobDegen commented Sep 23, 2022

Enable MIR inlining for generators too #99782

Enable MIR inlining for generators too #99782

Conversation

cjgillot commented Jul 26, 2022

rustbot commented Jul 26, 2022

rust-highfive commented Jul 26, 2022

cjgillot commented Jul 26, 2022

rust-timer commented Jul 26, 2022

bors commented Jul 26, 2022

bors commented Jul 26, 2022

rust-timer commented Jul 26, 2022

rust-timer commented Jul 27, 2022

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

fee1-dead commented Jul 27, 2022

cjgillot commented Jul 27, 2022

This comment has been minimized.

bors commented Jul 28, 2022

bors commented Aug 9, 2022

bors commented Aug 22, 2022

bors commented Aug 25, 2022

cjgillot commented Aug 27, 2022

rust-timer commented Aug 27, 2022

bors commented Aug 27, 2022

cjgillot commented Aug 27, 2022

rust-timer commented Aug 27, 2022

bors commented Aug 27, 2022

bors commented Aug 27, 2022

rust-timer commented Aug 27, 2022

rust-timer commented Aug 28, 2022

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

rustbot commented Sep 7, 2022

bors commented Sep 13, 2022

cjgillot commented Sep 22, 2022

JakobDegen commented Sep 22, 2022

wesleywiser commented Sep 23, 2022

JakobDegen commented Sep 23, 2022