Bump MSRV to 1.36 #1011

vks · 2020-08-01T19:03:57Z

This allows us to get rid of some unsafe code. I did not test whether this affects performance however.

newpavlov · 2020-08-01T19:33:07Z

We also can replace:

#![cfg_attr(not(feature = "std"), no_std)]
#[cfg(all(feature = "alloc", not(feature = "std")))] extern crate alloc;

with:

#[cfg(feature = "alloc")]
extern crate alloc;
#[cfg(feature = "std")]
extern crate std;

dhardy · 2020-08-01T19:35:30Z

Technically this shouldn't affect some crates, e.g. rand_pcg, but of course we aren't testing older Rust versions so bumping the "MSRV" still makes sense.

vks · 2020-08-01T20:40:11Z

Also see rust-secure-code/safety-dance#54.

It seems like I broke some code on big endian platforms, let me check...

src/distributions/mod.rs

src/rngs/adapter/read.rs

dhardy

Lots of nice changes!

Since it wasn't linked, a large chunk of this PR is about these issues: rust-secure-code/safety-dance#54

Could you check the benchmarks please? Specifically relating to the int → bytes and bytes → int conversions (generators).

rand_core/src/le.rs

dhardy · 2020-08-02T08:19:47Z

rand_core/src/impls.rs

 pub fn next_u32_via_fill<R: RngCore + ?Sized>(rng: &mut R) -> u32 {
    let mut buf = [0; 4];
    rng.fill_bytes(&mut buf);
-    u32::from_le_bytes(buf)
+    u32::from_ne_bytes(buf)
 }


?

Indeed, the old version didn't byte-swap. But fill_bytes_via_next does byte-swap. Weird?

I'm also puzzled.

rand_core/src/impls.rs

vks · 2020-08-04T15:27:26Z

Could you check the benchmarks please? Specifically relating to the int → bytes and bytes → int conversions (generators).

Generating bytes is ca. 8 % slower, which is not ideal but I think acceptable. Generating u32 and u64 is not affected.

Before (master):

gen_bytes_chacha12:   2,733,838 ns/iter (+/- 181,694) = 374 MB/s
gen_bytes_chacha20:   4,339,602 ns/iter (+/- 237,793) = 235 MB/s
gen_bytes_chacha8:    1,918,279 ns/iter (+/- 103,581) = 533 MB/s
gen_u32_chacha20:      18,866 ns/iter (+/- 2,748) = 212 MB/s
gen_u64_chacha20:      36,510 ns/iter (+/- 6,994) = 219 MB/s
init_chacha:               44 ns/iter (+/- 5)

After (this PR):

gen_bytes_chacha12:   2,968,875 ns/iter (+/- 360,677) = 344 MB/s
gen_bytes_chacha20:   4,544,087 ns/iter (+/- 297,158) = 225 MB/s
gen_bytes_chacha8:    2,065,126 ns/iter (+/- 164,368) = 495 MB/s
gen_u32_chacha20:      18,793 ns/iter (+/- 1,655) = 212 MB/s
gen_u64_chacha20:      34,735 ns/iter (+/- 2,007) = 230 MB/s
init_chacha:               33 ns/iter (+/- 3)

dhardy · 2020-08-05T08:16:29Z

Quick test on my side: 4-9% slower bytes generation from ChaCha; 23% slower from HC128. Other code paths (e.g. bytes from PCG and int outputs) appear unaffected.

The default byte-length is 1024; I tested 128 and 12000 with basically identical results (form the latter):

# master branch:
test gen_bytes_chacha12      ... bench:  29,536,911 ns/iter (+/- 560,458) = 406 MB/s
test gen_bytes_chacha20      ... bench:  47,129,314 ns/iter (+/- 715,162) = 254 MB/s
test gen_bytes_chacha8       ... bench:  20,503,464 ns/iter (+/- 290,756) = 585 MB/s
test gen_bytes_hc128         ... bench:   5,426,882 ns/iter (+/- 112,118) = 2211 MB/s

# bump-msrv branch:
test gen_bytes_chacha12      ... bench:  31,337,042 ns/iter (+/- 728,833) = 382 MB/s
test gen_bytes_chacha20      ... bench:  49,303,860 ns/iter (+/- 3,711,389) = 243 MB/s
test gen_bytes_chacha8       ... bench:  22,460,498 ns/iter (+/- 2,488,804) = 534 MB/s
test gen_bytes_hc128         ... bench:   7,034,898 ns/iter (+/- 671,394) = 1705 MB/s

I'm not really happy about this performance loss. Do we have any better options?

newpavlov · 2020-08-09T09:36:48Z

I think we have to change fill_bytes code, otherwise compiler will not be able to properly optimize out panics and various checks. Here is a draft, unfortunately it's not completely panic-free, since we can't constrain index to be less than length of results. But we can solve it with a bit of unsafe code.

UPD: On a second look, the linked code is not correct (it will not fill "tail" bytes), but I think the general idea can be understood.

vks · 2020-08-10T21:31:03Z

@newpavlov Are you talking about BlockRng::fill_bytes?

newpavlov · 2020-08-11T05:12:40Z

Yes.

vks · 2020-08-11T13:25:48Z

@newpavlov With your suggestion, I get the same performance as before (without resorting to unsafe code), however, the results are not quite correct: test_chacha_true_values_c is failing.

UPD: On a second look, the linked code is not correct (it will not fill "tail" bytes), but I think the general idea can be understood.

Do you know what is wrong? I don't see it.

newpavlov · 2020-08-11T15:37:13Z

fill_via_u32 should look something like this:

fn fill_via_u32(src: &[u32], dst: &mut [u8]) -> usize {
    let mut src = src.iter();
    let mut chunks = dst.chunks_exact_mut(4);
    for (v, chunk) in (&mut src).zip(&mut chunks) {
        chunk.copy_from_slice(&v.to_le_bytes());
    }
    let rem = chunks.into_remainder();
    if rem.len() != 0 {
        let v = src.next().unwrap();
        rem.copy_from_slice(&v.to_le_bytes()[..rem.len()]);
        dst.len()/4 + 1
    } else {
        dst.len()/4
    }
}

Unfortunately it adds a panic on the unwrap. We could use chunks_mut instead, but it would add a panic path as well. The core idea is to allow compiler to properly optimize out the core loop and I think it will work with such function as well.

But honestly, I don't quite like the current approach and I think it's worth to experiment with one proposed here.

vks · 2020-08-11T16:17:43Z

But honestly, I don't quite like the current approach and I think it's worth to experiment with one proposed here.

I agree, being able to efficiently provide bitwise randomness seems more future proof for algorithms that optimize for the consumed entropy, and it makes generating bool more efficient.

fill_via_u32 should look something like this:

At this point, we can go back to using rand_core::impls::fill_via_u32_chunks. Unfortunately, this brings back the old performance problems.

vks · 2020-08-11T16:35:30Z

@dhardy If you think the performance of the safe code is not acceptable, we can always go back to the old fill_via_chunks implementation. This restores the old performance.

dhardy · 2020-08-11T17:33:53Z

@newpavlov If bit- or byte-level counters work better here (with reasonable perf. all round), I don't have a problem with that. Also I don't think changing behaviour in a breaking release before 1.0 is a big deal; as long as we document it we're doing better than many rand libs.

Personally I still feel that allowing loss-less bit-level consumption is not very useful in terms of performance, and even byte-level consumption may not be overall. But I may be wrong.

newpavlov · 2020-08-11T19:03:53Z

@dhardy
Calculation of an aligned index takes only 4 instructions, we would also need to add several instructions to check if resulting index points at the end of a buffer, but I think performance hit will be barely noticeable. So I think it will be reasonable to keep the old implementation of fill_via_chunks for now and experiment on a new bit-level index in a separate PR after this one will be merged.

vks · 2020-08-12T00:08:37Z

So I think it will be reasonable to keep the old implementation of fill_via_chunks for now and experiment on a new bit-level index in a separate PR after this one will be merged.

I did that, they performance is comparable to before now:

test gen_bytes_chacha12      ... bench:   2,725,956 ns/iter (+/- 468,335) = 375 MB/s
test gen_bytes_chacha20      ... bench:   4,182,217 ns/iter (+/- 197,325) = 244 MB/s
test gen_bytes_chacha8       ... bench:   1,882,080 ns/iter (+/- 140,729) = 544 MB/s
test gen_bytes_hc128         ... bench:     505,143 ns/iter (+/- 47,225) = 2027 MB/s

I think this can be merged now (maybe after squashing)?

rand_core/src/block.rs

rand_core/src/impls.rs

src/distributions/mod.rs

The necessary standard library functions were stabilized with Rust 1.34. Our MSRV is 1.36.

This is possible thanks to `alloc` being implied by `std` builds since Rust 1.36.

The results from master, using unsafe code: ``` gen_bytes_chacha12: 2,733,838 ns/iter (+/- 181,694) = 374 MB/s gen_bytes_chacha20: 4,339,602 ns/iter (+/- 237,793) = 235 MB/s gen_bytes_chacha8: 1,918,279 ns/iter (+/- 103,581) = 533 MB/s ``` The results of the new code using `chunks_exact_mut` (this commit): ``` gen_bytes_chacha12: 3,049,147 ns/iter (+/- 220,631) = 335 MB/s gen_bytes_chacha20: 4,645,772 ns/iter (+/- 269,261) = 220 MB/s gen_bytes_chacha8: 2,214,954 ns/iter (+/- 1,745,600) = 462 MB/s ``` The results of using `chunks_mut` (before this commit): ``` gen_bytes_chacha12: 3,492,109 ns/iter (+/- 164,638) = 293 MB/s gen_bytes_chacha20: 5,087,706 ns/iter (+/- 249,219) = 201 MB/s gen_bytes_chacha8: 2,700,197 ns/iter (+/- 524,148) = 379 MB/s ```

dhardy

This time I can't just check the new commits thanks to a rebase, so I'll have to trust 😉

newpavlov reviewed Aug 1, 2020

View reviewed changes

src/distributions/mod.rs Outdated Show resolved Hide resolved

src/rngs/adapter/read.rs Outdated Show resolved Hide resolved

dhardy reviewed Aug 2, 2020

View reviewed changes

vks mentioned this pull request Aug 26, 2020

Investigate use of bytemuck #957

Closed

Shnatsel mentioned this pull request Aug 26, 2020

Audit rand rust-secure-code/safety-dance#54

Open

dhardy reviewed Aug 27, 2020

View reviewed changes

rand_core/src/block.rs Outdated Show resolved Hide resolved

rand_core/src/impls.rs Outdated Show resolved Hide resolved

rand_core/src/impls.rs Show resolved Hide resolved

src/distributions/mod.rs Outdated Show resolved Hide resolved

vks force-pushed the bump-msrv branch from 707124d to 6005e6b Compare August 27, 2020 15:44

vks added 9 commits August 27, 2020 19:21

Bump MSRV to 1.36

8057576

Reenable alloc tests for MSRV

eef82c4

Drop some unsafe code from rand_core

0432b06

The necessary standard library functions were stabilized with Rust 1.34. Our MSRV is 1.36.

Avoid indexing

afc6bc6

Simplify cfg logic

f4746e1

This is possible thanks to `alloc` being implied by `std` builds since Rust 1.36.

Try to fix endianess issue

6911b8f

Simplify tests to address review feedback

514e60d

WeightedIndex: Fix serde test

e97a7e6

Fix fill_via_chunks and add tests

73422d3

vks added 8 commits August 27, 2020 19:21

Simplify macro

6e27f41

Check source size and prefer chunks_exact

23e02db

Simplify code

d74e800

Restore unsafe fill_via_chunks implementation for performance

422bb44

Delete dead code

11439b5

Slightly improve test

0913bc7

Update changelog

2bc54d0

vks force-pushed the bump-msrv branch from 6005e6b to 2bc54d0 Compare August 27, 2020 17:23

dhardy mentioned this pull request Aug 28, 2020

Fix next_u64_via_fill to use little endian order #1026

Closed

dhardy approved these changes Aug 28, 2020

View reviewed changes

vks merged commit 2c6c6b1 into rust-random:master Aug 28, 2020

vks deleted the bump-msrv branch August 28, 2020 13:32

dhardy mentioned this pull request Aug 28, 2020

Bump MSRV for rand 0.8 #1009

Closed

newpavlov mentioned this pull request Aug 28, 2020

Tracker: rand_core 0.6 #1029

Closed

3 tasks

CAD97 mentioned this pull request Dec 21, 2020

Update rand requirement from 0.7 to 0.8 bevyengine/bevy#1114

Merged

sync-by-unito bot mentioned this pull request Dec 21, 2020

Bump rand from 0.7.3 to 0.8.0 ProvableHQ/leo#510

Closed

This was referenced Mar 9, 2021

Update rand requirement from 0.7 to 0.8 hacspec/hacspec#80

Merged

chore(deps): update rand requirement from 0.7 to 0.8 transparencies/zip#2

Open

Bump rand from 0.7.3 to 0.8.3 Lionjudge9061-corp/mobilecoin#2

Open

dependabot bot mentioned this pull request Mar 16, 2021

Bump rand from 0.7.3 to 0.8.3 ZeusWPI/zauth#65

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump MSRV to 1.36 #1011

Bump MSRV to 1.36 #1011

vks commented Aug 1, 2020

newpavlov commented Aug 1, 2020

dhardy commented Aug 1, 2020

vks commented Aug 1, 2020

dhardy left a comment

dhardy Aug 2, 2020

vks Aug 2, 2020

vks commented Aug 4, 2020

dhardy commented Aug 5, 2020

newpavlov commented Aug 9, 2020 •

edited

Loading

vks commented Aug 10, 2020

newpavlov commented Aug 11, 2020

vks commented Aug 11, 2020

newpavlov commented Aug 11, 2020

vks commented Aug 11, 2020

vks commented Aug 11, 2020

dhardy commented Aug 11, 2020

newpavlov commented Aug 11, 2020 •

edited

Loading

vks commented Aug 12, 2020

dhardy left a comment

Bump MSRV to 1.36 #1011

Bump MSRV to 1.36 #1011

Conversation

vks commented Aug 1, 2020

newpavlov commented Aug 1, 2020

dhardy commented Aug 1, 2020

vks commented Aug 1, 2020

dhardy left a comment

Choose a reason for hiding this comment

dhardy Aug 2, 2020

Choose a reason for hiding this comment

vks Aug 2, 2020

Choose a reason for hiding this comment

vks commented Aug 4, 2020

dhardy commented Aug 5, 2020

newpavlov commented Aug 9, 2020 • edited Loading

vks commented Aug 10, 2020

newpavlov commented Aug 11, 2020

vks commented Aug 11, 2020

newpavlov commented Aug 11, 2020

vks commented Aug 11, 2020

vks commented Aug 11, 2020

dhardy commented Aug 11, 2020

newpavlov commented Aug 11, 2020 • edited Loading

vks commented Aug 12, 2020

dhardy left a comment

Choose a reason for hiding this comment

newpavlov commented Aug 9, 2020 •

edited

Loading

newpavlov commented Aug 11, 2020 •

edited

Loading