Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

weird multithreading performance #15

Open
ThomasWaldmann opened this issue Jun 15, 2024 · 1 comment
Open

weird multithreading performance #15

ThomasWaldmann opened this issue Jun 15, 2024 · 1 comment

Comments

@ThomasWaldmann
Copy link

I did some practical experiment there and got weird results:

borgbackup/borg#8217 (comment)

@Rogdham
Copy link
Owner

Rogdham commented Jun 16, 2024

Hello, I gave a look to your script, and the following caught my eyes:

  1. The data you generate is not only random in content, but also random in size. I feel like the outputs would be more reliable if you generate the data only once, save it into a global variable, and use the very same data in each case.
  2. The data to be compressed is very small: between 0.5 and 4MiB.
  3. The jobSize is very small as well: 512kiB

Have you tried running the same test with bigger data sizes?

Another thing that way be worth investigating is running the zstd command directly to see if the figures you have are inherent to Zstandard or are more specific to the pyzstd library.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants