Stream the parallel xz/gz tarball generation #76

cuviper · 2018-01-17T01:32:40Z

This melds the serial-Tee and parallel-batched approaches from before
and after commit adea17e. Now we can get the same multithreaded speedup
without having to build the entire uncompressed tarball in memory first.

The new impl Write for RayonTee uses rayon::join to split the
compression work for each buffer to separate threads. This is scoped,
so it can be fully zero-copy, sharing the input buffer directly. This
is all wrapped in a 1 MiB BufWriter to balance the cost of thread
wake-ups and synchronization.

The net performance is unchanged, using around 125% CPU -- approximately
4:1 time spent in xz versus gz. The overall memory use is much reduced,
now independent of the tarball size -- just a few MiB on top of the
fixed-cost 674 MiB compressor memory requirements of xz -9.

Fixes #75.

This melds the serial-`Tee` and parallel-batched approaches from before and after commit adea17e. Now we can get the same multithreaded speedup without having to build the entire uncompressed tarball in memory first. The new `impl Write for RayonTee` uses `rayon::join` to split the compression work for each buffer to separate threads. This is scoped, so it can be fully zero-copy, sharing the input buffer directly. This is all wrapped in a 1 MiB `BufWriter` to balance the cost of thread wake-ups and synchronization. The net performance is unchanged, using around 125% CPU -- approximately 4:1 time spent in xz versus gz. The overall memory use is much reduced, now independent of the tarball size -- just a few MiB on top of the fixed-cost 674 MiB compressor memory requirements of `xz -9`.

alexcrichton · 2018-01-17T01:58:03Z

Awesome, thanks @cuviper!

Pull in rust-lang/rust-installer#76 to get streamed tarball generation, rather than batching it all in memory, while still getting the benefit of compressing in parallel.

…k-Simulacrum Update rust-installer for streaming parallelism Pull in rust-lang/rust-installer#76 to get streamed tarball generation, rather than batching it all in memory, while still getting the benefit of compressing in parallel.

alexcrichton merged commit b55e0fc into rust-lang:master Jan 17, 2018

cuviper mentioned this pull request Jan 17, 2018

Update rust-installer for streaming parallelism rust-lang/rust#47509

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stream the parallel xz/gz tarball generation #76

Stream the parallel xz/gz tarball generation #76

cuviper commented Jan 17, 2018

alexcrichton commented Jan 17, 2018

Stream the parallel xz/gz tarball generation #76

Stream the parallel xz/gz tarball generation #76

Conversation

cuviper commented Jan 17, 2018

alexcrichton commented Jan 17, 2018