Add accuracy note to gen_bool #347

pitdicker · 2018-03-28T06:37:20Z

A first change to the documentation in reply to @huonw's comment on Reddit.

I don't think adding an extra check for 0.0 in gen_bool is worth it. If p is variable and the code expects to receive both true and false, there is no problem, we are just at the limit of the accuracy. If the goal is to consistently generate false by using 0.0 as a constant, why are you using gen_bool?

dhardy

Alternatively maybe we should use HighPrecision01 here and add a gen_bool_const variant with this method? I don't really like having two methods for the same job, but each has its advantage/disadvantage.

dhardy · 2018-03-28T07:48:04Z

src/lib.rs

+    /// # Accuracy note
+    ///
+    /// `gen_bool` uses 32 bits of the RNG, so if you use it to generate close
+    /// to or more than 2^32 results, a tiny bias may becomes noticable.


dhardy · 2018-03-28T07:51:30Z

src/lib.rs

+    /// `gen_bool` uses 32 bits of the RNG, so if you use it to generate close
+    /// to or more than 2^32 results, a tiny bias may becomes noticable.
+    /// A notable consequence of the method used here is that the worst case is
+    /// `rng.gen_bool(0.0)`: it has a chance of 1 in 2^32 of being true, while


Sorry, I told you to use <= after examining the upper bound but not the lower. I guess it's better to use < then and ensure gen_bool(0.0) is correct.

Then we would shift to problem to gen_bool(1.0). But I see our implementation as reasonable. We could alternatively just always return false for gen_bool(0.0). But I wouldn't bother, using gen_bool like this is really nor very sensible (as I tried to wr3ite in the first comment, and the comment in the code).

To top it off, some generators are explicitly designed not to produce 0. We could use (rng.gen() - 1) < p_int, but I guess that's slower?

😄 Good point. Nothing seems perfect any more once you know too many details...

That would have to use a wrapping subtraction, and would also only shift the problem from 0.0 to 1.0.

No, it should use rejection sampling — but then there's no point shifting. I agree, we're thinking too much into this 😆

pitdicker · 2018-03-28T10:32:52Z

I am starting to think that implementing a different method as the Bernoulli distribution could make sense, but would like to keep gen_bool as it is.

pitdicker · 2018-03-28T18:23:16Z

Ready to merge? (seems like the latest nightly broke stdweb, and somehow we try to use stdweb on two builders now... #352)

dhardy · 2018-03-29T08:30:43Z

Well, apart from the spelling error, it's an improvement, though I'm not entirely happy with it.

pitdicker · 2018-03-29T08:35:32Z

I remember changing and pushing that. No idea what happened 😄.

What is the part you are not entirely happy with? That there is a situation where it is not perfect?

dhardy · 2018-03-29T08:40:40Z

Yes. And that we have a more accurate approach which is just as fast (according to my benchmarking) when p is not statically known (which is commonly the case). But I guess we can get to that later.

pitdicker · 2018-03-29T08:50:21Z

The way I see it is that with an accuracy of 32 bits, we can be off by 2^-33. And with rounding, this becomes visible in the 0.0 case.

when p is not statically known (which is commonly the case).

I was thinking just the opposite 😄, thinking about the current uses of gen_weighted_bool.

But what did you think about also adding a bernoulli distribution to add the more precise method?

dhardy · 2018-03-29T09:45:10Z

I was thinking of sim work I've done in the past where many probabilities are fixed, but read from config files, and many more are derived. But I don't know what the average use is!

What, gen_bool and gen_bernoulli? We could I guess. Both are no more than a couple lines of code, and the method currently used is the only one with any potential for pre-computation.

pitdicker · 2018-03-29T10:18:48Z

No, I didn't have an extra method on Rng in mind. Weren't you once thinking about adding a Bernoulli distribution?

Add accuracy note to gen_bool

dhardy reviewed Mar 28, 2018

View reviewed changes

Add accuracy note to gen_bool

ca4722b

pitdicker force-pushed the gen_bool_accuracy_note branch from 26ec612 to ca4722b Compare March 29, 2018 08:34

dhardy merged commit 683d6b9 into rust-random:master Apr 1, 2018

pitdicker deleted the gen_bool_accuracy_note branch April 1, 2018 17:30

pitdicker mentioned this pull request Apr 2, 2018

Add Bernoulli distribution #300

Closed

pitdicker pushed a commit that referenced this pull request Apr 4, 2018

Merge pull request #347 from pitdicker/gen_bool_accuracy_note

235b7d1

Add accuracy note to gen_bool

dhardy mentioned this pull request Jun 6, 2018

Add API for getting a bool with chance of exactly 1-in-10 or 2-in-3 #491

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add accuracy note to gen_bool #347

Add accuracy note to gen_bool #347

pitdicker commented Mar 28, 2018

dhardy left a comment

dhardy Mar 28, 2018

dhardy Mar 28, 2018

pitdicker Mar 28, 2018

dhardy Mar 28, 2018

pitdicker Mar 28, 2018 •

edited

Loading

dhardy Mar 28, 2018

pitdicker commented Mar 28, 2018

pitdicker commented Mar 28, 2018

dhardy commented Mar 29, 2018

pitdicker commented Mar 29, 2018

dhardy commented Mar 29, 2018

pitdicker commented Mar 29, 2018

dhardy commented Mar 29, 2018 •

edited

Loading

pitdicker commented Mar 29, 2018

Add accuracy note to gen_bool #347

Add accuracy note to gen_bool #347

Conversation

pitdicker commented Mar 28, 2018

dhardy left a comment

Choose a reason for hiding this comment

dhardy Mar 28, 2018

Choose a reason for hiding this comment

dhardy Mar 28, 2018

Choose a reason for hiding this comment

pitdicker Mar 28, 2018

Choose a reason for hiding this comment

dhardy Mar 28, 2018

Choose a reason for hiding this comment

pitdicker Mar 28, 2018 • edited Loading

Choose a reason for hiding this comment

dhardy Mar 28, 2018

Choose a reason for hiding this comment

pitdicker commented Mar 28, 2018

pitdicker commented Mar 28, 2018

dhardy commented Mar 29, 2018

pitdicker commented Mar 29, 2018

dhardy commented Mar 29, 2018

pitdicker commented Mar 29, 2018

dhardy commented Mar 29, 2018 • edited Loading

pitdicker commented Mar 29, 2018

pitdicker Mar 28, 2018 •

edited

Loading

dhardy commented Mar 29, 2018 •

edited

Loading