hygiene: Ensure uniqueness of `SyntaxContextData`s #130324

petrochenkov · 2024-09-13T20:49:40Z

SyntaxContextDatas are basically interned with SyntaxContexts working as indices, so they are supposed to be unique.
However, currently duplicate SyntaxContextDatas can be created during decoding from metadata or incremental cache.
This PR fixes that.

cc #129827 (comment)

rustbot · 2024-09-13T20:49:47Z

r? @TaKO8Ki

rustbot has assigned @TaKO8Ki.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

petrochenkov · 2024-09-13T20:51:58Z

I've added many asserts, I'll change them to debug asserts if they affect performance.
@bors try @rust-timer queue

hygiene: Ensure uniqueness of `SyntaxContextData`s `SyntaxContextData`s are basically interned with `SyntaxContext`s working as keys, so they are supposed to be unique. However, currently duplicate `SyntaxContextData`s can be created during decoding from metadata or incremental cache. This PR fixes that. cc rust-lang#129827 (comment)

bors · 2024-09-13T20:53:08Z

⌛ Trying commit e577b7a with merge b517457...

bors · 2024-09-13T22:45:30Z

☀️ Try build successful - checks-actions
Build commit: b517457 (b51745778d3c14275d7b8f9115c2aa8e3b760bfb)

rust-timer · 2024-09-14T00:38:55Z

Finished benchmarking commit (b517457): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.3%	[0.2%, 0.4%]	11
Regressions ❌ (secondary)	0.3%	[0.1%, 1.1%]	38
Improvements ✅ (primary)	-0.3%	[-0.4%, -0.2%]	10
Improvements ✅ (secondary)	-0.4%	[-0.4%, -0.3%]	3
All ❌✅ (primary)	0.0%	[-0.4%, 0.4%]	21

Max RSS (memory usage)

Results (secondary 1.0%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	3.7%	[3.7%, 3.7%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.6%	[-1.6%, -1.6%]	1
All ❌✅ (primary)	-	-	0

Cycles

Results (secondary -12.6%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-12.6%	[-15.5%, -2.5%]	7
All ❌✅ (primary)	-	-	0

Binary size

Results (primary -0.5%, secondary -0.8%)

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-0.5%	[-2.0%, -0.0%]	63
Improvements ✅ (secondary)	-0.8%	[-2.6%, -0.0%]	14
All ❌✅ (primary)	-0.5%	[-2.0%, -0.0%]	63

Bootstrap: 756.444s -> 757.208s (0.10%)
Artifact size: 341.13 MiB -> 341.19 MiB (0.02%)

cjgillot · 2024-09-14T01:52:20Z

I'm not super fond of the "hopefully" rhetoric...
SyntaxContexts form a tree structure, is there a way we could exploit it?
By refactoring all this into a DFS in metadata/cache to fetch the root, and then decode the children in the proper order?

petrochenkov · 2024-09-14T06:29:22Z

SyntaxContexts form a tree structure

Right now they are not a tree because the opaque(_and_semitransparent) are caches that often refer to the context itself.
With #129827 SyntaxContextKeys will probably be a tree - they should be a tree but I'm not sure that proc macro or built-in macro logic cannot mess up something, need to verify it with a bunch of asserts too.
So the whole idea will be easier to implement after #129827.

petrochenkov · 2024-09-14T06:39:41Z

FIXME: The holes left by decoder break the logic assigning $crate names in fn update_dollar_crate_names, it's not too important because it's just for pretty printing, but still better to fix it.

petrochenkov · 2024-09-14T06:46:03Z

and then decode the children in the proper order?

Ah, there's one more thing - not all contexts are coming from the decoder (during incremental compilation at least).

Many contexts come from the freshly redone compilation (which is typically done before incremental decoding starts) and then they need to "unify" with equivalent contexts coming from decoding - that's where the duplicates were coming from before this PR.

So even if all decoding is done in proper order, you can still decode and get a context that is equivalent to one of the freshly built ones, but you don't know it until you decode it and compare.

Maybe if #129827 eliminates recursion we'll be able to avoid reserving SyntaxContexts in advance, then there will be no holes.

petrochenkov added 2 commits September 13, 2024 23:06

hygiene: Asserts, comments, code cleanup

978d4f7

hygiene: Ensure uniqueness of SyntaxContextDatas

e577b7a

rustbot assigned TaKO8Ki Sep 13, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Sep 13, 2024

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Sep 13, 2024

petrochenkov removed the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Sep 13, 2024

petrochenkov mentioned this pull request Sep 13, 2024

perform less decoding if it has the same syntax context #129827

Open

This comment has been minimized.

Sign in to view

cjgillot self-assigned this Sep 13, 2024

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Sep 14, 2024

petrochenkov added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Sep 14, 2024

petrochenkov unassigned TaKO8Ki Sep 14, 2024

alex-semenyuk added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hygiene: Ensure uniqueness of `SyntaxContextData`s #130324

hygiene: Ensure uniqueness of `SyntaxContextData`s #130324

petrochenkov commented Sep 13, 2024 •

edited

Loading

rustbot commented Sep 13, 2024

petrochenkov commented Sep 13, 2024

This comment has been minimized.

bors commented Sep 13, 2024

bors commented Sep 13, 2024

This comment has been minimized.

rust-timer commented Sep 14, 2024

cjgillot commented Sep 14, 2024

petrochenkov commented Sep 14, 2024 •

edited

Loading

petrochenkov commented Sep 14, 2024

petrochenkov commented Sep 14, 2024

hygiene: Ensure uniqueness of SyntaxContextDatas #130324

Are you sure you want to change the base?

hygiene: Ensure uniqueness of SyntaxContextDatas #130324

Conversation

petrochenkov commented Sep 13, 2024 • edited Loading

rustbot commented Sep 13, 2024

petrochenkov commented Sep 13, 2024

This comment has been minimized.

bors commented Sep 13, 2024

bors commented Sep 13, 2024

This comment has been minimized.

rust-timer commented Sep 14, 2024

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

cjgillot commented Sep 14, 2024

petrochenkov commented Sep 14, 2024 • edited Loading

petrochenkov commented Sep 14, 2024

petrochenkov commented Sep 14, 2024

hygiene: Ensure uniqueness of `SyntaxContextData`s #130324

hygiene: Ensure uniqueness of `SyntaxContextData`s #130324

petrochenkov commented Sep 13, 2024 •

edited

Loading

petrochenkov commented Sep 14, 2024 •

edited

Loading