Make too-large byte sizes a validity error in `Layout` #99134

scottmcm · 2022-07-10T22:44:07Z

Now that #95295 made sizes over isize::MAX disallowed in Layout, here's a PR to make it a validity error, and thus something that CTFE/MIRI can catch, like was done for alignment in #95361.

r? @Mark-Simulacrum
cc @CAD97

library/core/src/mem/valid_size.rs

CAD97 · 2022-07-10T22:56:11Z

I worry that this might have negative compiletime impact (c.f. #99117 and just how touchy this code is to insignificant changes), but there's no way to tell without trying it.

src/test/ui/consts/std/alloc.32bit.stderr

library/core/src/mem/valid_size.rs

scottmcm · 2022-07-11T01:05:52Z

This doesn't touch monomorphized methods, and all of it should be inlined while compiling core by LLVM (or maybe even MIR, now!) since it's all runtime nothing, so hopefully it'll not have perf impact. But let's check:

@bors try @rust-timer queue

rust-timer · 2022-07-11T01:05:55Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2022-07-11T01:06:01Z

⌛ Trying commit b34f5b9 with merge 31f194589df939278844135001202a5fb3a984dd...

bors · 2022-07-11T02:29:38Z

☀️ Try build successful - checks-actions
Build commit: 31f194589df939278844135001202a5fb3a984dd (31f194589df939278844135001202a5fb3a984dd)

rust-timer · 2022-07-11T02:29:40Z

Queued 31f194589df939278844135001202a5fb3a984dd with parent c396bb3, future comparison URL.

rust-timer · 2022-07-11T03:46:52Z

Finished benchmarking commit (31f194589df939278844135001202a5fb3a984dd): comparison url.

Instruction count

Primary benchmarks: mixed results
Secondary benchmarks: mixed results

	mean¹	max	count²
Regressions 😿 (primary)	0.8%	0.8%	2
Regressions 😿 (secondary)	0.4%	0.4%	2
Improvements 🎉 (primary)	-0.6%	-0.8%	6
Improvements 🎉 (secondary)	-2.6%	-5.6%	5
All 😿🎉 (primary)	-0.3%	-0.8%	8

Max RSS (memory usage)

Results

Primary benchmarks: 🎉 relevant improvement found
Secondary benchmarks: 🎉 relevant improvements found

	mean¹	max	count²
Regressions 😿 (primary)	N/A	N/A	0
Regressions 😿 (secondary)	N/A	N/A	0
Improvements 🎉 (primary)	-4.8%	-4.8%	1
Improvements 🎉 (secondary)	-2.4%	-3.5%	4
All 😿🎉 (primary)	-4.8%	-4.8%	1

Cycles

Results

Primary benchmarks: mixed results
Secondary benchmarks: 🎉 relevant improvements found

	mean¹	max	count²
Regressions 😿 (primary)	4.0%	4.0%	1
Regressions 😿 (secondary)	N/A	N/A	0
Improvements 🎉 (primary)	-3.8%	-3.8%	1
Improvements 🎉 (secondary)	-6.5%	-9.5%	7
All 😿🎉 (primary)	0.1%	4.0%	2

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

the arithmetic mean of the percent change ↩ ↩² ↩³
number of relevant changes ↩ ↩² ↩³

scottmcm · 2022-07-11T04:40:50Z

Perf here looks to me like it's a wash -- about as much improvements as regressions -- so that's good!

And one huge, supposedly significant improvement that I don't trust:

So I think this is good for review.

CAD97 · 2022-07-11T05:08:37Z

(it took me a while to write this so I didn't notice @scottmcm had already posted a minor analysis)

Secondary: the externs stress test saw both the two regressions and the hugely significant boost, to incremental compilation. Odd that this would be the case. Both cases are an estimated cycles benefit, though, so /shrug

Primary: mixed, but more green than red. All primary changes are opt builds; debug is unchanged.

RSS: hey look at that, an unambiguous win, even if somewhat isolated.

I think these results at a minimum support that this change isn't a negative. Changes in this file just tend to be noisier than in others.

Mark-Simulacrum · 2022-07-14T01:35:30Z

I'm worried that we're increasingly making things like this language-UB (at least at a current-implementation level), rather than "just" adding asserts that CTFE/MIRI understand. It seems like catching someone transmuting from/to Layout is probably not the intent here, so adding an ub_checking_assert or something like it feels like a better path here. (I think we could define that in std/core today? We support panic! in CTFE and we have the closure-based dispatch intrinsic to detect CTFE, which I think includes miri evaluation?)

My main concern comes from this adding more noise to our IR and in general being harder to follow for folks working with these types, rather than just asserting in constructors. It also seems unfortunate to send a message that in order to get a little more checking for safety preconditions you have to make the leap to make violating them a full-blown validity error, not just around actual usage, which might escalate what would've been an assert of some kind into just UB. In general, it feels like just because we can define some range on a type doesn't mean that we should make it UB to be outside that range -- it's the equivalent of doing unchecked_add and such in cold code.

To be clear, I'm all for improving the ability of MIRI and other tooling to catch these bugs, but I'm not convinced that this is the right approach. In the long run we're definitely going to want a sanitizer_assert! or similar, I expect, so I think we should invest in that now, rather than running with the (poor, annoying to use, very sharp tool) rustc attr-based hack around its lack.

CAD97 · 2022-07-14T04:32:21Z

That might be a good argument to revert the ValidAlign (is-power-of-two int via enum) change and add ValidSize instead. The support code to define ValidAlign is nontrivial, but ValidSize is a much simpler standard application of #[rustc_valid_scalar_range] and provides essentially the same amount of niche space.

Along those lines, perhaps we could add a #[cfg(miri)] arm to assert_unsafe_precondition!? (cargo miri goes through the runtime codepath. I don't know which -Zunleash-the-miri-inside-of-you takes.)

scottmcm · 2022-07-16T00:15:24Z

I would much rather keep ValidAlign than ValidSize, if I had to choose. Because CAD's change in https://github.com/rust-lang/rust/pull/99136/files#diff-787ccc657785af6ea82114db5d168f11efb1caf9adc8eeeafb39323a2163e2d8R77 depends on having the type boundary for it to be able to make that function safe.

I think looking at just the MIRI detection misses a big part of the value, here. I'd really like to make https://doc.rust-lang.org/nightly/std/alloc/struct.Layout.html#method.align_to take a public ValidAlign of some sort, because the current semantics of "you just get garbage if you pass something that's not a power of two" is distasteful to me.

Admittedly, that doesn't need a validity error for it, specifically, just restricted safe constructors. But honestly once the type exists the work to have the validity attributes is negligible, so I feel like they might as well exist to give LLVM nice !range metadata and such.

I could close this PR and file an ACP for making ValidAlign into a "real" type before revisiting more like this, if you'd like.

Mark-Simulacrum · 2022-07-23T17:40:29Z

I don't have any particular objections to the types existing (ValidSize and ValidAlign, etc.), but more so that it seems quite unfortunate if we need to escalate to validity to get this kind of checking. It seems to me that LLVM isn't really going to cooperate here for now anyway (based on #49572), so it's not clear that we gain much from validity.

Ultimately, I suppose it doesn't matter that much - it's unlikely that users will practically encounter problems from the validity invariant here (vs. a safety invariant), but it just feels icky to me. I think it comes down to the fact that if we have merely a safety invariant, it's much easier for us to catch that at runtime (without miri) via asserts, but with a validity invariant it becomes increasingly likely that it just ends up UB and LLVM optimizes it out entirely. (Sort of like how C & C++ code, and probably Rust too, have problems with assert!(ptr != nullptr) getting deleted on them by optimizers).

It might be that concern isn't really applicable to these types though, which are more constrained in use and less likely to get accidentally dereferenced or otherwise assert validity before the check.

Based solely on the diff, r=me; I'll leave it up to you whether the argument I'm making feels convincing enough that it merits further discussion (probably in a libs meeting?). I'm a little iffy on it myself, only around 60-70% convinced this is going to be painful later. But we can also revert; we're not making stable guarantees here, so maybe that's OK.

scottmcm · 2022-09-06T21:36:14Z

I'll close this for now, and decide whether to come back to it later after maybe proposing libs-api changes.

scottmcm · 2022-09-17T21:14:03Z

I opened an ACP to expose the Alignment newtype to core: rust-lang/libs-team#108

Make too-large byte sizes a validity error in Layout

cfd9ece

rust-highfive assigned Mark-Simulacrum Jul 10, 2022

rustbot added T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Jul 10, 2022

This comment was marked as resolved.

Sign in to view

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 10, 2022

CAD97 reviewed Jul 10, 2022

View reviewed changes

library/core/src/mem/valid_size.rs Show resolved Hide resolved

CAD97 reviewed Jul 10, 2022

View reviewed changes

library/core/src/mem/valid_size.rs Outdated Show resolved Hide resolved

CAD97 reviewed Jul 10, 2022

View reviewed changes

src/test/ui/consts/std/alloc.32bit.stderr Outdated Show resolved Hide resolved

fix CI; use assert_unsafe_precondition

b34f5b9

scottmcm force-pushed the bytes-validity-invariant-for-layout branch from cfd03ba to b34f5b9 Compare July 11, 2022 00:10

faptc reviewed Jul 11, 2022

View reviewed changes

library/core/src/mem/valid_size.rs Show resolved Hide resolved

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Jul 11, 2022

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Jul 11, 2022

scottmcm removed the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Jul 11, 2022

scottmcm added the perf-regression-triaged The performance regression has been triaged. label Jul 11, 2022

Mark-Simulacrum added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 14, 2022

Mark-Simulacrum added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Jul 16, 2022

Mark-Simulacrum added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 23, 2022

scottmcm closed this Sep 6, 2022

scottmcm mentioned this pull request Sep 20, 2022

#95295 causes unsoundness in multiple existing crates #101899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make too-large byte sizes a validity error in `Layout` #99134

Make too-large byte sizes a validity error in `Layout` #99134

scottmcm commented Jul 10, 2022

This comment was marked as resolved.

CAD97 commented Jul 10, 2022

scottmcm commented Jul 11, 2022

rust-timer commented Jul 11, 2022

bors commented Jul 11, 2022

bors commented Jul 11, 2022

rust-timer commented Jul 11, 2022

rust-timer commented Jul 11, 2022

scottmcm commented Jul 11, 2022

CAD97 commented Jul 11, 2022 •

edited

Loading

Mark-Simulacrum commented Jul 14, 2022

CAD97 commented Jul 14, 2022

scottmcm commented Jul 16, 2022

Mark-Simulacrum commented Jul 23, 2022

scottmcm commented Sep 6, 2022

scottmcm commented Sep 17, 2022

Make too-large byte sizes a validity error in Layout #99134

Make too-large byte sizes a validity error in Layout #99134

Conversation

scottmcm commented Jul 10, 2022

This comment was marked as resolved.

CAD97 commented Jul 10, 2022

scottmcm commented Jul 11, 2022

rust-timer commented Jul 11, 2022

bors commented Jul 11, 2022

bors commented Jul 11, 2022

rust-timer commented Jul 11, 2022

rust-timer commented Jul 11, 2022

Instruction count

Max RSS (memory usage)

Cycles

Footnotes

scottmcm commented Jul 11, 2022

CAD97 commented Jul 11, 2022 • edited Loading

Mark-Simulacrum commented Jul 14, 2022

CAD97 commented Jul 14, 2022

scottmcm commented Jul 16, 2022

Mark-Simulacrum commented Jul 23, 2022

scottmcm commented Sep 6, 2022

scottmcm commented Sep 17, 2022

Make too-large byte sizes a validity error in `Layout` #99134

Make too-large byte sizes a validity error in `Layout` #99134

CAD97 commented Jul 11, 2022 •

edited

Loading