Experiment: Only track fingerprints for queries with reconstructible dep-nodes. #118667

michaelwoerister · 2023-12-06T09:50:42Z

This is an experiment to collect performance data about alternative ways to adapt #109050. The PR makes the following change:

All queries with keys that are not reconstructible from their corresponding DepNode are now treated similar to anonymous queries. That is we don't compute a DepNode or result fingerprint for them.

This has some implications:

We save time because query keys and results don't have to be hashed.
We can save space storing less data for these nodes in the on-disk dep-graph. (not implemented in this PR as I ran out of time. Maybe this would be a quick fix for @saethlin though?)
We don't have to worry about hash collisions for DepNode in these cases (although we still have to worry about hash collisions for result fingerprints, which might include all the same HashStable impls)
Same as with anonymous queries, the graph can grow additional nodes and edges in some situations because existing graph parts might be promoted while new parts are allocated for the same query if it is re-executed. I don't know how much this happens in practice.
We cannot cache query results for queries with complex keys.

Given that that last point affects some heavy queries, I have my doubts that this strategy is a win. But let's run it through perf at least once.

cc @cjgillot, @Zoxc

r? @ghost

michaelwoerister · 2023-12-06T10:58:15Z

@bors try @rust-timer queue

bors · 2023-12-06T11:00:36Z

⌛ Trying commit f2fd56c with merge 139a4ac...

…ingerprints, r=<try> Experiment: Only track fingerprints for queries with reconstructible dep-nodes. This is an experiment to collect performance data about alternative ways to adapt rust-lang#109050. The PR makes the following change: All queries with keys that are not reconstructible from their corresponding DepNode are now treated similar to anonymous queries. That is we don't compute a DepNode or result fingerprint for them. This has some implications: - We save time because query keys and results don't have to be hashed. - We can save space storing less data for these nodes in the on-disk dep-graph. (not implemented in this PR as I ran out of time. Maybe this would be a quick fix for `@saethlin` though?) - We don't have to worry about hash collisions for DepNode in these cases (although we still have to worry about hash collisions for result fingerprints, which might include all the same HashStable impls) - Same as with anonymous queries, the graph can grow additional nodes and edges in some situations because existing graph parts might be promoted while new parts are allocated for the same query if it is re-executed. I don't know how much this happens in practice. - We cannot cache query results for queries with complex keys. Given that that last point affects some heavy queries, I have my doubts that this strategy is a win. But let's run it through perf at least once. cc `@cjgillot,` `@Zoxc` r? `@ghost`

bors · 2023-12-06T12:27:23Z

☀️ Try build successful - checks-actions
Build commit: 139a4ac (139a4acb26220019f321f5893932e9486dbbb47d)

bjorn3 · 2023-12-06T14:42:27Z

We cannot cache query results for queries with complex keys.

Does that also mean the CompileCodegenUnit and CompileMonoItem dep nodes are no longer tracked? Or does this not apply to dep nodes that don't have an explicit query, but are using the tcx.dep_graph.with_task() api? cg_clif passes a quite complex type as argument for the CompileCodegenUnit dep node ((BackendConfig, Arc<GlobalAsmConfig>, Symbol, ConcurrencyLimiterToken)).

rust-timer · 2023-12-06T16:22:21Z

Finished benchmarking commit (139a4ac): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	112.8%	[0.2%, 1907.4%]	112
Regressions ❌ (secondary)	153.2%	[0.3%, 3197.3%]	49
Improvements ✅ (primary)	-1.5%	[-3.6%, -0.3%]	44
Improvements ✅ (secondary)	-1.9%	[-6.1%, -0.2%]	64
All ❌✅ (primary)	80.6%	[-3.6%, 1907.4%]	156

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	16.2%	[1.9%, 143.9%]	62
Regressions ❌ (secondary)	8.4%	[1.4%, 44.6%]	18
Improvements ✅ (primary)	-4.8%	[-21.1%, -0.6%]	55
Improvements ✅ (secondary)	-9.0%	[-38.7%, -2.1%]	27
All ❌✅ (primary)	6.3%	[-21.1%, 143.9%]	117

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	193.3%	[2.8%, 1816.3%]	64
Regressions ❌ (secondary)	242.1%	[6.6%, 2938.2%]	29
Improvements ✅ (primary)	-3.5%	[-8.7%, -1.7%]	38
Improvements ✅ (secondary)	-5.3%	[-10.6%, -2.0%]	28
All ❌✅ (primary)	120.0%	[-8.7%, 1816.3%]	102

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 675.083s -> 673.668s (-0.21%)
Artifact size: 314.18 MiB -> 314.02 MiB (-0.05%)

bjorn3 · 2023-12-06T16:25:11Z

Does that also mean the CompileCodegenUnit and CompileMonoItem dep nodes are no longer tracked?

I think the perf results show that this is indeed the case.

michaelwoerister · 2023-12-06T16:54:12Z

Does that also mean the CompileCodegenUnit and CompileMonoItem dep nodes are no longer tracked?

Well, I actually wanted to say that these are not affected directly 🙂 Anything that explicitly uses DepGraph::with_task works the same as before. I still think that's true (modulo any bugs I introduced).

What I suspect is happening is that we cannot mark some crucial queries green after re-evaluation anymore. The hypothesis is that, before, a query like symbol_name was "maybe changed" (because one of its inputs was red) but then re-evaluating it yielded the same result as in the previous session, so it would be marked green after all. But now, that query instance cannot be correlated to the previous instance anymore, so the system assumes that it has changed.

This could be solved by making more query keys reconstructible, but that's complicated and I'm not sure it would be worth the trouble.

This comment has been minimized.

Sign in to view

Experiment: Only track fingerprints for reconstructible dep-nodes.

f2fd56c

michaelwoerister force-pushed the experiment-sparse-fingerprints branch from 57ca354 to f2fd56c Compare December 6, 2023 10:20

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Dec 6, 2023

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Dec 6, 2023

michaelwoerister closed this Dec 6, 2023

michaelwoerister mentioned this pull request Dec 7, 2023

Only use the new node hashmap for anonymous nodes. #112469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment: Only track fingerprints for queries with reconstructible dep-nodes. #118667

Experiment: Only track fingerprints for queries with reconstructible dep-nodes. #118667

michaelwoerister commented Dec 6, 2023

This comment has been minimized.

michaelwoerister commented Dec 6, 2023

This comment has been minimized.

bors commented Dec 6, 2023

bors commented Dec 6, 2023

This comment has been minimized.

bjorn3 commented Dec 6, 2023 •

edited

Loading

rust-timer commented Dec 6, 2023

bjorn3 commented Dec 6, 2023

michaelwoerister commented Dec 6, 2023

Experiment: Only track fingerprints for queries with reconstructible dep-nodes. #118667

Experiment: Only track fingerprints for queries with reconstructible dep-nodes. #118667

Conversation

michaelwoerister commented Dec 6, 2023

This comment has been minimized.

michaelwoerister commented Dec 6, 2023

This comment has been minimized.

bors commented Dec 6, 2023

bors commented Dec 6, 2023

This comment has been minimized.

bjorn3 commented Dec 6, 2023 • edited Loading

rust-timer commented Dec 6, 2023

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

bjorn3 commented Dec 6, 2023

michaelwoerister commented Dec 6, 2023

bjorn3 commented Dec 6, 2023 •

edited

Loading