Question - Help with setting the correct options for dgraph-io/badger #196

jarifibrahim · 2019-12-25T06:08:33Z

Hey @klauspost, thank you for writing this amazing library in Go. I work on https://github.com/dgraph-io/badger and we'd like to use this library instead of the CGO based ZSTD implementation.
We had a small chat about this a while ago https://discuss.dgraph.io/t/badger-compression-feedback/5478

Here's what we're compressing in badger
Badger stores key-values in a table called SST. Each SST is divided into blocks of 4KB by default. We'd like to compress these blocks.

Compression of the blocks is a one-time thing but decompression happens every time a new block is accessed (this is a frequent operation).

I understand that https://github.com/klauspost/compress/tree/master/zstd#blocks can be used to compress small blocks but is 4KB considered as a small block?

We'd like to have a fair tradeoff between the decompression speed and compression ratio.

I see that there are a bunch of options for encoding and decoding but because of my limited knowledge about how ZSTD works, I can't seem to figure out which ones should be tweaked.
I'd really appreciate it if you can help me pick the appropriate options for encoding/decoding :)

klauspost · 2019-12-25T14:28:50Z

is 4KB considered as a small block.

Yes, definitely. I would also say very small. The frame header (6-16 bytes) and the CRC will be a rather significant part of the size. If you have something already dealing with bit rot, disabling CRC may be an idea here.

Test if using WithSingleSegment() is a benefit to you - it will make blocks a tiny bit bigger, but maybe a bit faster to decode. WithWindowSize should also automatically be set, so defaults should be fine. Since blocks are so small, WithEncoderLevel(zstd.SpeedFastest) is probably the way to go since 'default' will bring little benefit and has a bigger startup cost.

We'd like to have a fair tradeoff between the decompression speed and compression ratio.

I think the fastest mode will bring that. WithNoEntropyCompression will speed both enc+dec up, but it will compress considerably worse.

If your storage backend had no problem storing 0 length blobs, keep WithZeroFrames off (default). What this means is that 0 bytes input -> 0 bytes output. Otherwise a useless frame header plus maybe even crc is added.

Default with 'fastest' mode set should be fine, use the EncodeAll/DecodeAll. Remember that if you provide an existing slice for output, it should be length zero, but with the capacity for 4KB output. WithSingleSegment(false) will probably save you 2 bytes/block.

Oh, and when you benchmark, be sure to use real blocks of data and have a bunch of different ones.

jarifibrahim · 2019-12-26T10:39:47Z

Hey @klauspost, I tried running some benchmarks on two kinds of data

Table Data (contains some randomly generated data).

Compression Ratio Snappy 1.7531182795698925
Compression Ratio LZ4 1.7861524978089396

Compression Ratio Datadog ZSTD level 1 3.1993720565149135
Compression Ratio Datadog ZSTD level 3 3.099619771863118

Compression Ratio Go ZSTD 3.2170481452249406
Compression Ratio Go ZSTD level 3 3.1474903474903475

name                                        time/op
Comp/Compression/Snappy-16                   4.09µs ± 1%
Comp/Compression/LZ4-16                      5.06µs ± 1%
Comp/Compression/ZSTD_-_Datadog-level1-16    17.6µs ± 3%
Comp/Compression/ZSTD_-_Datadog-level3-16    20.7µs ± 3%
Comp/Compression/ZSTD_-_Go_-_level1-16       27.8µs ± 2%
Comp/Compression/ZSTD_-_Go_-_Default-16      39.1µs ± 1%
Comp/Decompression/Snappy-16                 1.13µs ± 1%
Comp/Decompression/LZ4-16                     642ns ± 1%
Comp/Decompression/ZSTD_-_Datadog-16         7.12µs ± 2%
Comp/Decompression/ZSTD_-_Go-16              13.7µs ± 2%

name                                       speed
Comp/Compression/Snappy-16                 1.00GB/s ± 1%
Comp/Compression/LZ4-16                     806MB/s ± 1%
Comp/Compression/ZSTD_-_Datadog-level1-16   231MB/s ± 3%
Comp/Compression/ZSTD_-_Datadog-level3-16   197MB/s ± 3%
Comp/Compression/ZSTD_-_Go_-_level1-16      147MB/s ± 2%
Comp/Compression/ZSTD_-_Go_-_Default-16     104MB/s ± 1%
Comp/Decompression/Snappy-16               3.60GB/s ± 1%
Comp/Decompression/LZ4-16                  6.34GB/s ± 1%
Comp/Decompression/ZSTD_-_Datadog-16        573MB/s ± 2%
Comp/Decompression/ZSTD_-_Go-16             298MB/s ± 2%

4KB of text taken from https://gist.github.com/StevenClontz/4445774

Compression Ratio Snappy 1.3053435114503817
Compression Ratio LZ4 1.1712328767123288

Compression Ratio ZSTD level 1 1.9294781382228492
Compression Ratio ZSTD level 3 1.9322033898305084

Compression Ratio Go ZSTD 1.894736842105263
Compression Ratio Go ZSTD level 3 1.927665570690465

name                                       time/op
Comp/Compression/Snappy-16                   6.88µs ± 2%
Comp/Compression/LZ4-16                      5.87µs ± 1%
Comp/Compression/ZSTD_-_Datadog-level1-16    22.7µs ± 4%
Comp/Compression/ZSTD_-_Datadog-level3-16    29.6µs ± 4%
Comp/Compression/ZSTD_-_Go_-_level1-16       35.7µs ± 1%
Comp/Compression/ZSTD_-_Go_-_Default-16      97.9µs ± 1%
Comp/Decompression/Snappy-16                 1.53µs ± 2%
Comp/Decompression/LZ4-16                     623ns ± 1%
Comp/Decompression/ZSTD_-_Datadog-16         8.36µs ± 0%
Comp/Decompression/ZSTD_-_Go-16              16.0µs ± 0%

name                                       speed
Comp/Compression/Snappy-16                  597MB/s ± 2%
Comp/Compression/LZ4-16                     699MB/s ± 1%
Comp/Compression/ZSTD_-_Datadog-level1-16   181MB/s ± 4%
Comp/Compression/ZSTD_-_Datadog-level3-16   139MB/s ± 4%
Comp/Compression/ZSTD_-_Go_-_level1-16      115MB/s ± 1%
Comp/Compression/ZSTD_-_Go_-_Default-16    41.9MB/s ± 1%
Comp/Decompression/Snappy-16               2.69GB/s ± 2%
Comp/Decompression/LZ4-16                  6.58GB/s ± 0%
Comp/Decompression/ZSTD_-_Datadog-16        489MB/s ± 2%
Comp/Decompression/ZSTD_-_Go-16             256MB/s ± 0%

Here's the script I've used https://gist.github.com/jarifibrahim/91920e93d1ecac3006b269e0c05d6a24

I have a couple of questions

Why does the compression ratio worsen when I use actual text instead of random data?
I see a considerable speed difference between the go implementation and cgo based ZSTD. Is this expected?

klauspost · 2019-12-27T08:34:40Z

You need a wider variety of inputs to really judge that. Different data will have different characteristics and with a single input you are not getting the full picture. With really small data set you are training the CPU for specific branches. Using different types of data will give a more realistic picture, which will probably be some losses, some wins, etc.
The C implementation has had many, many hours poured into it so it is pretty much as good as things can get, and Go has a natural disadvantage with a less advanced compiler and certain forced checks and zeroing. That said I have not focused that much on very small blocks yet, so it is likely that there are still some gains to be had.

But your benchmark code looks solid, so except for maybe testing more different types of blocks it should give a fine image.

klauspost · 2019-12-27T13:18:34Z

Added some experimental code for small blocks: #199

Not a huge improvement, but worth taking.

klauspost · 2019-12-28T10:54:20Z

Found a much bigger improvement. Now about 15% faster on the fastest setting.

jarifibrahim · 2019-12-29T10:18:32Z

Found a much bigger improvement. Now about 15% faster on the fastest setting.

This is amazing @klauspost . I'll benchmark the new code

Thank you so much for helping out with this :)

jarifibrahim mentioned this issue Dec 26, 2019

Use pure Go based ZSTD implementation dgraph-io/badger#1176

Closed

klauspost added the question label Dec 28, 2019

jarifibrahim closed this as completed Dec 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question - Help with setting the correct options for dgraph-io/badger #196

Question - Help with setting the correct options for dgraph-io/badger #196

jarifibrahim commented Dec 25, 2019

klauspost commented Dec 25, 2019 •

edited

Loading

jarifibrahim commented Dec 26, 2019

klauspost commented Dec 27, 2019

klauspost commented Dec 27, 2019

klauspost commented Dec 28, 2019

jarifibrahim commented Dec 29, 2019

Question - Help with setting the correct options for dgraph-io/badger #196

Question - Help with setting the correct options for dgraph-io/badger #196

Comments

jarifibrahim commented Dec 25, 2019

klauspost commented Dec 25, 2019 • edited Loading

jarifibrahim commented Dec 26, 2019

klauspost commented Dec 27, 2019

klauspost commented Dec 27, 2019

klauspost commented Dec 28, 2019

jarifibrahim commented Dec 29, 2019

klauspost commented Dec 25, 2019 •

edited

Loading