Improve stdlib's random float generation #10428

mrakh · 2021-12-28T00:16:42Z

Status quo is that to generate a random value in the half open interval [0, 1), a random value in [1, 2) is generated via bit fiddling, and then 1 is subtracted.

The problem is that, in IEEE-754, the number of distinct values in [1, 2) is significantly smaller than that of [0, 1). In the single-precision case, only 1/127 of the available range is covered. In the double-precision case, the ratio is even less, only 1/1023.

This commit fixes these shortcomings by directly generating the mantissa and exponent. The mantissa bits are extracted directly from the RNG, and @clz is used to apply an exponential bias for the exponent bits. With the new implementation, the available range significantly increases.

matu3ba

If you describe "Over the previous method, this has the advantage of being able to represent much more of the available range.", then you might want to explain this more properly in the comments.

Is there a way to test this stuff?

lib/std/rand.zig

daurnimator

Is this really still uniform over the range?

lib/std/rand.zig

mrakh · 2021-12-28T20:43:19Z

I've made some changes:

Added some logic so that the randgen should now be able to generate every possible IEEE-754 value in the [0, 1) range, even the subnormals
Added cold path test for random float function
Added chi-square goodness of fit test for random float function

If you describe "Over the previous method, this has the advantage of being able to represent much more of the available range.", then you might want to explain this more properly in the comments.

Is there a way to test this stuff?

The goodness of fit test that I added should now verify that the generated values come from a uniformly distributed RNG. The test currently aggregates 100,000 numbers into 1,000 buckets and checks that the p-value is > 0.05; feel free to adjust the numbers to your liking. I personally experimented with 100,000,000 random values and 10,000 buckets and got p-values between 0.2 and 0.7, depending on the RNG seed, but that took quite a few seconds for me to run.

Is this really still uniform over the range?

Yes, it should be. For a given exponent value, the distribution of mantissa bits should be uniform, so the mantissa bits can be directly extracted from the RNG. The exponent value is exponentially distributed, since each successive exponent spans double the range of the previous one. In the [0, 1) interval for the f32 case, the exponent must be an integer in the interval [0, 127), with successive values being twice as likely. And if you compute the clz on an N-bit uniform RNG, you get an exponentially distributed RNG in the interval [0, N] where each successive number is half as likely. So if you compute the clz of a 126-bit number, then subtract it from 126, you should get exponent bits for a uniformly randomly chosen single-precision IEEE-754 number within [0, 1).

mrakh · 2021-12-29T03:46:25Z

CI failure is due to timeout, but the relevant tests pass.

daurnimator · 2021-12-30T02:47:44Z

lib/std/rand.zig

@@ -15,6 +15,8 @@ const math = std.math;
 const ziggurat = @import("rand/ziggurat.zig");
 const maxInt = std.math.maxInt;

+const Dilbert = @import("rand/Dilbert.zig");


If you name the file with _test.zig in the name then it won't get installed as part of the standard library.

daurnimator · 2021-12-30T02:48:43Z

lib/std/rand.zig

+                if (rand_lz == 41) {
+                    rand_lz += @clz(u64, r.int(u64));
+                    if (rand_lz == 41 + 64) {


I imagine this branching may have a significant speed penalty; has this been benchmarked at all?

Tested std.rand.Random.float(f32), old version vs new version without the branch vs. new version with the branch. Units are in nanoseconds per call in the below results:

Non-inlined Random.float(f32): old: mean = 11.0096, std_dev = 0.2327 new, no branching: mean = 13.0287, std_dev = 0.1954 (18.3% slower) new, with branching: mean = 13.8511, std_dev = 0.2545 (25.8% slower) Inlined Random.float(f32): old: mean = 1.7341, std_dev = 0.0437 new, no branching: mean = 1.8877, std_dev = 0.0081 (8.8% slower) new, with branching: mean = 2.3880, std_dev = 0.0118 (37.7% slower)

I experimented with changing the comparison so the CPU could compute it in parallel with clz and shorten the dependency chain:

const rand = r.int(u64); var rand_lz = @clz(u64, rand | 0x7FFFFF); - if (rand_lz == 41) { + if ((rand | 0x7FFFFF) == 0x7FFFFF) {

This made the inline case faster, but the non-inline case slower:

Non-inlined Random.float(f32): new, with updated branching: mean = 14.7841, std_dev = 0.0527 (34.2% slower) Inlined Random.float(f32): new, with updated branching: mean = 2.3205, std_dev = 0.0093 (33.8% slower)

I might dig into this a little further with perf.

have re-reviewed.

wooster0 · 2021-12-31T17:24:35Z

lib/std/rand.zig

+                if (rand_lz == 41) {
+                    rand_lz += @clz(u64, r.int(u64));
+                    if (rand_lz == 41 + 64) {
+                        // It is astronomically unlikely to reach this point.


Then wouldn't it be good to use @setCold or something here? It probably doesn't work here though because this is not in a function. Maybe if you extract this part into an inline function and use @setCold(true) there?

@setCold is function specific and I don't know how well cold attributes work on inline functions, better to just add a comment waiting for #5177 or #489 to be implemented.

Ah, I didn't know about those. Yes, those seem perfect for this situation.

Suggested change

// It is astronomically unlikely to reach this point.

// TODO: when #5177 or #489 is implemented

// tell the compiler it is astronomically unlikely to reach this point.

daurnimator · 2022-01-01T10:38:17Z

So something I thought of today is that this opens up a timing attack that wasn't there previously: certain numbers will take different amounts of time to generate.
@jedisct1 got thoughts on this?

mrakh · 2022-01-01T11:50:18Z

So something I thought of today is that this opens up a timing attack that wasn't there previously: certain numbers will take different amounts of time to generate.
@jedisct1 got thoughts on this?

I’m not sure what use case requires cryptographically secure generation of floating point values, but fair point. We can address the f32 case by always requesting the worst-case amount of necessary entropy (126 + 23 = 149 bits).

But the f64 worst-case entropy is much higher, at 1022 + 52 = 1074 bits. Requesting that many random bits for every f64 we wish to generate would have a quite negative impact on performance. On the other hand, generating n random bits would mean being unable to generate any value in the [0, 2^-(n+1)) interval. Perhaps requesting a fixed 192 random bits per f64 is a good compromise - 52 random bits for the mantissa, and 140 for the exponent? This puts the minimum generatable f64 value to 2^-141, at the cost of 3 invocations of std.rand.Random.float(u64).

jedisct1 · 2022-01-01T17:06:05Z

So something I thought of today is that this opens up a timing attack that wasn't there previously: certain numbers will take different amounts of time to generate.
@jedisct1 got thoughts on this?

Don't worry about it.

Floating point representation is never used in cryptography.

The only exception I can think of abuse floating point registers as additional registers to store 53-bit integers. They don't use an actual floating point representation.

andrewrk · 2022-05-11T02:03:43Z

Thank you for the enhancement!

matu3ba reviewed Dec 28, 2021

View reviewed changes

lib/std/rand.zig Outdated Show resolved Hide resolved

lib/std/rand.zig Outdated Show resolved Hide resolved

daurnimator previously requested changes Dec 28, 2021

View reviewed changes

lib/std/rand.zig Outdated Show resolved Hide resolved

daurnimator added the standard library This issue involves writing Zig code for the standard library. label Dec 28, 2021

daurnimator reviewed Dec 30, 2021

View reviewed changes

wooster0 reviewed Dec 31, 2021

View reviewed changes

mrakh and others added 2 commits May 10, 2022 18:50

std: improve random float generation

550888e

std.rand: move tests to a separate test file

7bedeb9

andrewrk force-pushed the rand_float_improvement branch from 7dba763 to 7bedeb9 Compare May 11, 2022 02:03

andrewrk merged commit ed63d6c into ziglang:master May 11, 2022

erikarvstedt mentioned this pull request May 12, 2022

Minor fixes for random float generation #11641

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve stdlib's random float generation #10428

Improve stdlib's random float generation #10428

mrakh commented Dec 28, 2021

matu3ba left a comment

daurnimator left a comment

mrakh commented Dec 28, 2021 •

edited

Loading

mrakh commented Dec 29, 2021

daurnimator Dec 30, 2021

daurnimator Dec 30, 2021

mrakh Jan 1, 2022

wooster0 Dec 31, 2021

Vexu Dec 31, 2021

wooster0 Dec 31, 2021

daurnimator commented Jan 1, 2022

mrakh commented Jan 1, 2022 •

edited

Loading

jedisct1 commented Jan 1, 2022

andrewrk commented May 11, 2022

	// It is astronomically unlikely to reach this point.
	// TODO: when #5177 or #489 is implemented
	// tell the compiler it is astronomically unlikely to reach this point.

Improve stdlib's random float generation #10428

Improve stdlib's random float generation #10428

Conversation

mrakh commented Dec 28, 2021

matu3ba left a comment

Choose a reason for hiding this comment

daurnimator left a comment

Choose a reason for hiding this comment

mrakh commented Dec 28, 2021 • edited Loading

mrakh commented Dec 29, 2021

daurnimator Dec 30, 2021

Choose a reason for hiding this comment

daurnimator Dec 30, 2021

Choose a reason for hiding this comment

mrakh Jan 1, 2022

Choose a reason for hiding this comment

wooster0 Dec 31, 2021

Choose a reason for hiding this comment

Vexu Dec 31, 2021

Choose a reason for hiding this comment

wooster0 Dec 31, 2021

Choose a reason for hiding this comment

daurnimator commented Jan 1, 2022

mrakh commented Jan 1, 2022 • edited Loading

jedisct1 commented Jan 1, 2022

andrewrk commented May 11, 2022

mrakh commented Dec 28, 2021 •

edited

Loading

mrakh commented Jan 1, 2022 •

edited

Loading