Zstd multithreaded output can depend on number of threads #2327

terrelln · 2020-09-25T02:30:03Z

Describe the bug
As reported by @animalize in Issue #2238:

When using ZSTD_e_end end directive and output buffer size >= ZSTD_compressBound() the job number is calculated by ZSTDMT_computeNbJobs() function. This function produces a different number of jobs depending on nbWorkers:

zstd/lib/compress/zstdmt_compress.c

Lines 1243 to 1255 in b706286

    
           static unsigned 
        
           ZSTDMT_computeNbJobs(const ZSTD_CCtx_params* params, size_t srcSize, unsigned nbWorkers) 
        
           { 
        
               assert(nbWorkers>0); 
        
               {   size_t const jobSizeTarget = (size_t)1 << ZSTDMT_computeTargetJobLog(params); 
        
                   size_t const jobMaxSize = jobSizeTarget << 2; 
        
                   size_t const passSizeMax = jobMaxSize * nbWorkers; 
        
                   unsigned const multiplier = (unsigned)(srcSize / passSizeMax) + 1; 
        
                   unsigned const nbJobsLarge = multiplier * nbWorkers; 
        
                   unsigned const nbJobsMax = (unsigned)(srcSize / jobSizeTarget) + 1; 
        
                   unsigned const nbJobsSmall = MIN(nbJobsMax, nbWorkers); 
        
                   return (multiplier>1) ? nbJobsLarge : nbJobsSmall; 
        
           }   }

Expected behavior
The output of zstd multithreaded compression must be independent of the number of threads.

Fix

Make ZSTDMT_computeNbJobs() independent of nbWorkers.
Add a fuzz test that checks that the output of multithreaded zstd is always independent of the number of threads.

Workaround
If you need to work around this bug, don't start your streaming job with ZSTD_e_end. Pass at least one byte of input with ZSTD_e_continue before calling ZSTD_e_end, or ensure your output buffer is < ZSTD_compressBound(inputSize).

The text was updated successfully, but these errors were encountered:

Cyan4973 · 2020-09-25T02:43:41Z

It's a shortcut to say that the outcome of multithreaded zstd does not depend on nb of threads.

Actually, the feature supported is that the outcome of streaming multithreaded zstd does not depend on nb of threads
(and that's what is used by the zstd CLI).

This definition makes it possible to consider another potential fix :
do not employ the one-pass shortcut for ZSTD_e_end when nbWorkers >= 1,
since it's the delegation to the one-pass mode which triggers this issue.

This could be less disruptive than trying to adapt the single-pass MT compressor,
which was never designed to offer this guarantee.

Another (potentially positive) side effect is that it would guarantee that streaming multithreaded compression is always non-blocking, since it would no longer delegate to the (blocking) single-pass mode.
edit : scrap that, no longer delegating to the single-pass mode doesn't guarantee non-blocking, since on receiving ZSTD_e_flush and ZSTD_e_end directive, the MT API contract changes from minimal forward progress to maximal progress.

ghost · 2020-09-25T02:51:56Z

Another (potentially positive) side effect is that it would guarantee that streaming multithreaded compression is always non-blocking, since it would no longer delegate to the blocking mode.

I once wanted to propose adding a ZSTD_compressStream3() function, that is always blocking in multithreaded compression.

If the caller keeps checking the non-blocking progress, ~~it's very inconvenient~~.

edit: Just found, checking the progress is not very inconvenient:

do {
    zstd_ret = ZSTD_compressStream2(self->cctx, &out, &in, ZSTD_e_continue);
} while (out.pos != out.size && in.pos != in.size && !ZSTD_isError(zstd_ret));

But it's better to have an always blocking ZSTD_compressStream3(), it may be faster a bit, IMO many programmer users don't need to get the compression progress.

terrelln · 2020-09-25T18:00:44Z

This could be less disruptive than trying to adapt the single-pass MT compressor,
which was never designed to offer this guarantee.

Yeah, that is probably easier. I had forgotten that all the jobs in the single pass MT compressor needed to be launched at once.

I once wanted to propose adding a ZSTD_compressStream3() function, that is always blocking in multithreaded compression.

Generally, the way people write streaming compression loops, it shouldn't be terribly inconvenient to not make maximal forward progress. If we were to add something like this, it wouldn't require a new API. We'd probably just need to add a compression parameter to control it. But, I don't currently see a great need for it.

Simplifies the code and removes blocking from zstdmt. At this point we could completely delete `ZSTDMT_compress_advanced_internal()`. However I'm leaving it in because I think we want to do that in the zstd-1.5.0 release, in case anyone is still using the ZSTDMT API, even though it is not installed by default. Fixes facebook#2327.

terrelln added the bug label Sep 25, 2020

terrelln self-assigned this Sep 25, 2020

terrelln mentioned this issue Sep 25, 2020

Stability of parallel compression #2238

Closed

terrelln mentioned this issue Oct 2, 2020

Fix zstdmt stability issues and clean up the zstdmt code #2339

Merged

terrelln closed this as completed in 1784c4b Oct 28, 2020

ghost mentioned this issue Dec 15, 2020

Updated CHANGELOG for v1.4.7 #2426

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zstd multithreaded output can depend on number of threads #2327

Zstd multithreaded output can depend on number of threads #2327

terrelln commented Sep 25, 2020 •

edited

Loading

Cyan4973 commented Sep 25, 2020 •

edited

Loading

ghost commented Sep 25, 2020 •

edited by ghost

Loading

terrelln commented Sep 25, 2020

Zstd multithreaded output can depend on number of threads #2327

Zstd multithreaded output can depend on number of threads #2327

Comments

terrelln commented Sep 25, 2020 • edited Loading

Cyan4973 commented Sep 25, 2020 • edited Loading

ghost commented Sep 25, 2020 • edited by ghost Loading

terrelln commented Sep 25, 2020

terrelln commented Sep 25, 2020 •

edited

Loading

Cyan4973 commented Sep 25, 2020 •

edited

Loading

ghost commented Sep 25, 2020 •

edited by ghost

Loading