Replace use of crypto/rand with more direct use of OS RNG #58

briansmith · 2015-12-30T05:45:38Z

In particular, consider rand::os::OsRng. If rand::os::OsRng is insufficient in some way then try to fix it. rand::os::OsRng is preferable because other Rust code is likely using it, and we can avoid duplicate code. Also, crypto/rand is pretty complicated and has a weird interface. To the extent that that complication is useful, we should implement it as a layer over rand::os::OsRng.

The text was updated successfully, but these errors were encountered:

briansmith · 2015-12-30T05:57:12Z

One problem is that OsRng implements Rng and there are a LOT of methods, in particular methods that have floating-point parameters and lots of methods that we'd never, ever use. It's not clear how the presence of the floating point types in the trait interface affects platforms where we want to avoid floating point stuff. It's not clear how good a job does in removing the extraneous methods. Perhaps we should propose a new, simpler, interface, that is just rand::os::OsRng::fill_bytes.

briansmith · 2016-01-05T02:33:23Z

See also https://boringssl.googlesource.com/boringssl/+/dca63cfa754503d75c615f5c134d9334f54c132b.

briansmith · 2016-02-01T19:35:59Z

See rust-lang/rust#27703 (comment)

briansmith · 2016-04-25T21:44:05Z

I've started a new implementation of ring::rand where more of the logic is in Rust.

It is now possible to explicitly manage when the file handle for /dev/urandom is open by constructing/destroying rand::SystemRandom instances appropriately.

Most applications should only have a single implementation of rand::SystemRandom instantiated to avoid the waste of having multiple file descriptors to /dev/urandom open and to avoid the overhead of opening/closing /dev/urandom. That was effectively how BoringSSL works and how ring worked before these changes. However, I didn't implement that because, generally, we shouldn't be using /dev/urandom anyway. Instead, we should be using RDRAND (or equivalents on ARM and other platforms) + PRP (e.g. ChaCha20), or getrandom syscall (or the equivalent on other platforms), for performance reasons. The /dev/urandom implementation should only exist as a least-common-denominator fallback.

I also dropped the RDRAND+ChaCha20 implementation.

So, we should:

Make SystemRandom use getrandom on Linux whenever possible. This is Issue Use the getrandom system call on Linux when it is available in CRYPTO_sysrand #148.
Add support for the equivalent of the getrandom on Mac OS X, BSD, etc. to SystemRandom (It is already done on Windows). This is issue Improve random number generation on Mac OS X and iOS #149.
Provide some way to avoid the fallback to /dev/urandom, for cases where we know that one of the better options will always be available.

As far as the RDRAND + PRP implementation goes: The BoringSSL implementation was kinda-sorta-fork/vm-duplication-safe, but in some ways it actually makes that kind of problem worse. Instead, we should encourage applications to use safer patterns. For example, it seems like applications would be best off using SystemRandom for everything by default, but whenever they need a PRNG for something associated to a socket, they can use a PRF (perhaps in conjunction with RDRAND or equivalents) from a SystemRandom-generated seed to get the performance boost. By tying the CSPRNG state to a socket, the application should be able to avoid all the VM duplication and forking issues safely. (I think Amazon's s2n does this.)

briansmith · 2016-04-29T21:50:41Z

Here's the relevant comment from #147:

The source of RAND_bytes notes:
/* Without a hardware RNG to save us from address-space duplication, the OS
* entropy is used directly. */
This protection is racy in the sense that the address space could be duplicated between the time that hwrand is called and the time the function returns. The more random data we're generating, the bigger this window of vulnerability becomes. To shrink the window of vulnerability, we should minimize the time between the call to hwrand and then time the function returns. One way to do this would be to make the call to hwrand the last thing that RAND_bytes does. The complication is that currently, when a large amount of data is used, RAND_bytes encrypts the output of hwrand. It would have to be changed to encrypt something else (a buffer of zeros, or state->partial_block) in that case.

DemiMarie · 2016-05-04T02:33:19Z

My thoughts:

/dev/urandom fails to block even before it has been properly seeded. Solution: poll() on /dev/random before reading anything from /dev/urandom.
RDRAND should not be trusted (possible NSA backdoor that cannot be checked except by someone who can reverse-engineer an IC).
getentropy() "does the right thing" w.r.t security, but I don't think it will solve the speed issue as the Linux kernel's algorithms (based on hashes, IIRC) are much slower than ChaCha20 (this should probably be reported as a kernel bug). On my system (i7 Haswell laptop) I can only read <14MB/s from /dev/urandom (vs ChaCha20 at ~1 cycle/byte).
pthread_atfork() isn't reliable – too many reasons to use the raw clone() syscall.

briansmith · 2016-07-03T01:46:57Z

/dev/urandom fails to block even before it has been properly seeded. Solution: poll() on /dev/random before reading anything from /dev/urandom.

If one doesn't trust /dev/urandom then the "disable_dev_urandom_fallback" feature can be used to disable the fallback. I don't want to add extra logic like poll to support /dev/urandom beyond what we currently do.

RDRAND should not be trusted (possible NSA backdoor that cannot be checked except by someone who can reverse-engineer an IC).

Without agreeing or disagreeing, now the only way we get random numbers if from the OS, so we're good here.

getentropy() "does the right thing" w.r.t security, but I don't think it will solve the speed issue as the Linux kernel's algorithms (based on hashes, IIRC) are much slower than ChaCha20 (this should probably be reported as a kernel bug). On my system (i7 Haswell laptop) I can only read <14MB/s from /dev/urandom (vs ChaCha20 at ~1 cycle/byte).

This is being fixed now in Linux kernels after 4.7 (I don't remember the exact version). It is unclear that we have any need for a faster RNG anyway. Would like to see use cases before we worry about it.

pthread_atfork() isn't reliable – too many reasons to use the raw clone() syscall.

Agreed.

Anyway, I believe everything actionable here was already done and/or has its own issue. Please file new issues if something has been overlooked.

briansmith · 2016-07-03T01:47:09Z

Also, thanks for the comments!

briansmith added enhancement static-analysis-and-type-safety performance labels Dec 30, 2015

briansmith mentioned this issue Feb 13, 2016

Remove all threading/locking code #109

Closed

4 tasks

briansmith mentioned this issue Apr 25, 2016

Use RDRAND intrinsics, enabling RDRAND on 32-bit x86 and Windows. #137

Closed

briansmith added the good-first-bug label Apr 25, 2016

briansmith mentioned this issue Apr 29, 2016

Improve address-space duplication logic in RAND_bytes #147

Closed

briansmith mentioned this issue Apr 29, 2016

Enable RDRAND-based PRNG on x86 (32-bit) #146

Closed

DemiMarie mentioned this issue May 4, 2016

Use the getrandom system call on Linux when it is available in CRYPTO_sysrand #148

Closed

briansmith closed this as completed Jul 3, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace use of crypto/rand with more direct use of OS RNG #58

Replace use of crypto/rand with more direct use of OS RNG #58

briansmith commented Dec 30, 2015

briansmith commented Dec 30, 2015

briansmith commented Jan 5, 2016

briansmith commented Feb 1, 2016

briansmith commented Apr 25, 2016 •

edited

Loading

briansmith commented Apr 29, 2016

DemiMarie commented May 4, 2016

briansmith commented Jul 3, 2016

briansmith commented Jul 3, 2016

Replace use of crypto/rand with more direct use of OS RNG #58

Replace use of crypto/rand with more direct use of OS RNG #58

Comments

briansmith commented Dec 30, 2015

briansmith commented Dec 30, 2015

briansmith commented Jan 5, 2016

briansmith commented Feb 1, 2016

briansmith commented Apr 25, 2016 • edited Loading

briansmith commented Apr 29, 2016

DemiMarie commented May 4, 2016

briansmith commented Jul 3, 2016

briansmith commented Jul 3, 2016

briansmith commented Apr 25, 2016 •

edited

Loading