question: Slowdown after upgrading from 1.0.26 to 1.0.28 #395

ivpavici · 2024-01-10T19:52:07Z

Hello!

This will be more of a question, sorry in advance if something is missing, but I'm not sure what details might help!

Anyway, not sure why we are experiencing a slowdown (> x2) while running tests in our project locally after bumping flate2 from 1.0.26 to 1.0.28. Everything else in the project stays the same!

Cargo.toml: flate2 = { version = "1.0.28" }

Cargo.lock:

[[package]]
name = "flate2"
version = "1.0.28"
source = "registry+https://github.com/rust-lang/crates.io-index"
checksum = "46303f565772937ffe1d394a4fac6f411c6013172fadde9dcdb1e147a086940e"
dependencies = [
 "crc32fast",
 "miniz_oxide",
]

The problem is that when we run the same tests on github actions there is no slowdown between these versions!

I personally am running tests on Ubuntu on WSL2... I can reproduce this behavior every time between versions!

More information needed

Give feedback

Consider trying different backends to see if it reproduces there as well
Bisect with local clone to find offending commit
See if a fix is possible
Options

The text was updated successfully, but these errors were encountered:

Byron · 2024-01-10T20:24:10Z

This configuration would mean that miniz_oxide is used, a pure Rust implementation. The miniz_oxide dependency didn't change in a while, so it looks like the cause should either be something in flate2, or a difference in the Rust compiler (unless you say it wasn't upgraded either).

In any case, I recommend to try 1.0.27, and once the first good and the first bad tag are found, I'd use a local clone of flate2 and set it up to be used with

[patch."crates-io"]
flate2 = { path = "/path/to/clone" }

Then git bisect can be used in the flate2 clone to find the commit that introduced the issue. With zlib-ng I didn't notice any issue on my side, but miniz-oxide is certainly different enough.

As this issue can't be reproduced here, I recommend closing it while considering the submission of a PR for a fix that works for you.

ivpavici · 2024-01-11T15:25:56Z

thank you @Byron !

We have identified why there is a slowdown locally vs on github actions... because locally I ran my project just with: cargo run ...and seems that this doesn't affect version 1.0.26

But, on github, there the package is built with --release flag (and running inside docker).
And here seems to be a difference for 1.0.28! It is 2 or 3 times slower when just ran by cargo run instead of cargo run --release

Locally , it seems that running both with --release results in same execution time:

cargo run --release with 1.0.26
Time: 268.679 s, estimated 944 s

cargo run --release with 1.0.28
Time: 267.24 s, estimated 269 s

Byron · 2024-01-11T19:21:13Z

Thanks for digging in and for solving the puzzle :). I am glad it isn't anything more serious.

jongiddy · 2024-01-11T19:49:36Z

A good candidate for the cause of the slowdown is #373. That PR explicitly prioritized correctness over performance, so it is positive that the times in release mode do not show any change.

flate2 1.0.28 built unoptimized is significantly slower than before an unsafe unsoundness was fixed. This unnecessarily slows down some of our tests which use `flate2` during test setup. For more info please check rust-lang/flate2-rs#395 This change makes sure that `flate2` is optimized even for a `dev` build.

Hexta · 2024-03-02T13:37:02Z

Hey,
Unfortunately, #373 caused issues in release mode as well.
One of the related issue is vectordotdev/vector#19981.

Byron · 2024-03-03T08:40:36Z

Thanks for bringing this to my attention!

I took a look at the flamegraph in vectordotdev/vector#19981 and noticed that the runtime is dominated by calls to an unaligned memset function. This is probably caused by the resize(cap, 0) call, which fills the output vec with zeroes. This didn't happen before the change in #373. However, this also means that the output vector isn't reused as much as it maybe should, or that the length of the output vector is changed back to below its capacity, so zeroes are repeatedly written.

Something I couldn't make sense of is this claim in the vector issue:

After version v0.34.0, the memory allocation system call count is very high when to do GzEncoder (I also confirm that this call stack occur when I using the zlib compression rather than gzip)

The change in #373 definitely does not allocate (or deallocate). The only change really is that it can fill unused capacity with zeroes the first time.

Thus I think it's more about an issue in the usage of output vectors during compression in vector that now becomes more prominent. It looks like this comment seems to indicate something like it as well.

I'd wait and see how vector ends up solving the issue, in order to figure out if anything can be done here to prevent issues of the same kind in future.

Byron · 2024-03-04T06:29:23Z

The current implementation indeed suffers from a performance degradation when exposed to many small writes during compression.

The problem is that the internal buffer is created with 32kb capacity. However, with each small write it's memset to its full capacity, just to be truncated to what's actually written right after.

These calls accumulate to something very costly. The solution is typically to avoid many small writes by buffering them, so in a way, leaving this as is seems like a net-positive as it can reveal issues with its usage, while being easy to fix in the caller's code.

flate2 1.0.28 built unoptimized is significantly slower than before an unsafe unsoundness was fixed. This unnecessarily slows down some of our tests which use `flate2` during test setup. For more info please check rust-lang/flate2-rs#395 This change makes sure that `flate2` is optimized even for a `dev` build.

Shatur · 2024-05-20T09:42:05Z

I also noticed the performance regression when updating from 1.0.27 to 1.0.28.
In my case release doesn't help and it's ~100 times slower...

Byron added help wanted question labels Jan 10, 2024

Byron closed this as completed Jan 11, 2024

kkovaacs mentioned this issue Feb 27, 2024

chore: make tests faster eqlabs/pathfinder#1825

Merged

ojh3636 mentioned this issue Feb 29, 2024

Perfomance degradation in elasticsearch sink when using compression (gzip or zlib) after v0.34.0 vectordotdev/vector#19981

Closed

bruceg mentioned this issue Mar 7, 2024

fix(compression): Fix gzip and zlib performance degradation vectordotdev/vector#20032

Merged

Shatur mentioned this issue May 22, 2024

Update zip dependency thscharler/spreadsheet-ods#50

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question: Slowdown after upgrading from 1.0.26 to 1.0.28 #395

question: Slowdown after upgrading from 1.0.26 to 1.0.28 #395

ivpavici commented Jan 10, 2024 •

edited by Byron

Loading

More information needed

Byron commented Jan 10, 2024

ivpavici commented Jan 11, 2024 •

edited

Loading

Byron commented Jan 11, 2024

jongiddy commented Jan 11, 2024

Hexta commented Mar 2, 2024

Byron commented Mar 3, 2024

Byron commented Mar 4, 2024

Shatur commented May 20, 2024

question: Slowdown after upgrading from 1.0.26 to 1.0.28 #395

question: Slowdown after upgrading from 1.0.26 to 1.0.28 #395

Comments

ivpavici commented Jan 10, 2024 • edited by Byron Loading

More information needed

Byron commented Jan 10, 2024

ivpavici commented Jan 11, 2024 • edited Loading

Byron commented Jan 11, 2024

jongiddy commented Jan 11, 2024

Hexta commented Mar 2, 2024

Byron commented Mar 3, 2024

Byron commented Mar 4, 2024

Shatur commented May 20, 2024

ivpavici commented Jan 10, 2024 •

edited by Byron

Loading

ivpavici commented Jan 11, 2024 •

edited

Loading