Implement `Rng::fill` #35

cloudhead · 2022-09-10T14:28:02Z

Fills a byte slice with random data.

It's too bad there's no generic gen function, but I think in most cases we want to fill byte slices.

notgull · 2022-09-10T14:37:47Z

src/lib.rs

+    #[inline]
+    pub fn fill(&self, slice: &mut [u8]) {
+        for item in slice {
+            *item = self.u8(..);


This implementation could be more efficient. There's probably a way to generate a u64 at a time and serialize that into the byte slice instead of repeatedly generating u8's.

Yes, was thinking about that. Let me look into it.

Ok, check 5e87fd0

Much faster. I don't know if there's a faster way without using unsafe.

Benchmarks:

test fill ... bench: 29 ns/iter (+/- 5) test fill_naive ... bench: 343 ns/iter (+/- 33)

notgull · 2022-09-10T14:38:37Z

src/lib.rs

+
+    /// Return a byte array of the given size.
+    #[inline]
+    pub fn bytes<const N: usize>(&self) -> [u8; N] {


Won't this be an MSRV bump?

Hmm, if it is, we'll have to decide whether it's worth it or not I guess.

I've removed this function for now, can open another PR fot it.

Fills a byte slice with random data. Benchmarks: test fill ... bench: 29 ns/iter (+/- 5) test fill_naive ... bench: 343 ns/iter (+/- 33)

notgull

I feel like you probably don't even need the copy_from_slice here. You could probably use try_into to cast the slices to &mut [u8; 8] and then just set that to the result of to_ne_bytes. You may also want to align the chunks to the 8-byte boundary to maybe take advantage of aligned writes.

notgull · 2022-09-10T15:39:19Z

src/lib.rs

+        // Filling the buffer in chunks of 8 is much faster.
+        let mut chunks = slice.chunks_exact_mut(8);
+        for items in chunks.by_ref() {
+            let r = self.u64(..);


Can't we use gen_u64() here instead of going through the u64(..) adaptor?

notgull · 2022-09-10T15:40:33Z

src/lib.rs

+        let mut chunks = slice.chunks_exact_mut(8);
+        for items in chunks.by_ref() {
+            let r = self.u64(..);
+            items.copy_from_slice(&r.to_le_bytes());


I think we should use to_ne_bytes() instead in order to avoid a performance hit on big endian systems.

cloudhead · 2022-09-10T18:27:37Z

Ah, yes indeed possible, however after trying it out, the benchmark doesn't read any performance difference, so I think the generated code is the same:

    /// Fill a byte slice with random data.
    #[inline]
    pub fn fill(&self, slice: &mut [u8]) {
        // Filling the buffer in chunks of 8 is much faster.
        let mut chunks = slice.chunks_exact_mut(8);
        for items in chunks.by_ref() {
            let r = self.gen_u64();
            let sl: &mut [u8; 8] = items.try_into().unwrap();

            *sl = r.to_ne_bytes();
        }

        for item in chunks.into_remainder() {
            *item = self.u8(..);
        }
    }

What do you think? The only nit with the above code is the unwrap() I guess.

notgull

This looks good to me. That being said, it might be above my pay grade to merge this.

@smol-rs/admins Any thoughts on this?

fogti · 2022-09-12T11:14:59Z

I'm not sure if the #[inline] tagging of the function is appropriate, given that it is relatively large. Can we get an LLVM IR of it?

cloudhead · 2022-09-13T08:48:13Z

I'm not sure if the #[inline] tagging of the function is appropriate, given that it is relatively large. Can we get an LLVM IR of it?

Yeah I agree, I had it there originally when the function was small and unoptimized, but I would remove it now. For the record, the benchmark isn't affected when #[inline] is removed.

cloudhead · 2022-09-13T08:49:29Z

#[inline] now removed.

Don't have time to look into the LLVM stuff, sorry!

cloudhead force-pushed the master branch from 88413c4 to 66b2491 Compare September 10, 2022 14:37

notgull reviewed Sep 10, 2022

View reviewed changes

cloudhead force-pushed the master branch from 5c6b860 to 992fae3 Compare September 10, 2022 15:15

Implement Rng::fill

5e87fd0

Fills a byte slice with random data. Benchmarks: test fill ... bench: 29 ns/iter (+/- 5) test fill_naive ... bench: 343 ns/iter (+/- 33)

cloudhead force-pushed the master branch from 992fae3 to 5e87fd0 Compare September 10, 2022 15:18

notgull reviewed Sep 10, 2022

View reviewed changes

notgull approved these changes Sep 12, 2022

View reviewed changes

fogti approved these changes Sep 12, 2022

View reviewed changes

Small improvements to Rng::fill

3663e59

cloudhead force-pushed the master branch from 0d24fa0 to 3663e59 Compare September 13, 2022 08:48

fogti merged commit 6fe2c33 into smol-rs:master Sep 13, 2022

taiki-e mentioned this pull request Feb 12, 2023

Remove interior mutability from Rng #47

Merged

notgull mentioned this pull request Feb 12, 2023

v1.9.0 #48

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `Rng::fill` #35

Implement `Rng::fill` #35

cloudhead commented Sep 10, 2022

notgull Sep 10, 2022

cloudhead Sep 10, 2022

cloudhead Sep 10, 2022 •

edited

Loading

cloudhead Sep 10, 2022

notgull Sep 10, 2022

cloudhead Sep 10, 2022

cloudhead Sep 10, 2022

notgull left a comment

notgull Sep 10, 2022

notgull Sep 10, 2022

cloudhead commented Sep 10, 2022

notgull left a comment

fogti commented Sep 12, 2022

cloudhead commented Sep 13, 2022

cloudhead commented Sep 13, 2022 •

edited

Loading

Implement Rng::fill #35

Implement Rng::fill #35

Conversation

cloudhead commented Sep 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cloudhead Sep 10, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

notgull left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cloudhead commented Sep 10, 2022

notgull left a comment

Choose a reason for hiding this comment

fogti commented Sep 12, 2022

cloudhead commented Sep 13, 2022

cloudhead commented Sep 13, 2022 • edited Loading

Implement `Rng::fill` #35

Implement `Rng::fill` #35

cloudhead Sep 10, 2022 •

edited

Loading

cloudhead commented Sep 13, 2022 •

edited

Loading