fix sequence compression API in Explicit Delimiter mode #3023

Cyan4973 · 2022-01-22T19:42:02Z

In Explicit Delimiter mode, the Sequence compression API was only working when
the frame has only a single block
or when all blocks are full (except the last one).

This fix makes is possible to select block boundaries wherever the caller wants them.
It still requires that data conforms to zstd specification, aka block sizes still have a maximum.

The fuzzer capability has been extended to test the explicit delimiter mode too.

terrelln · 2022-01-25T01:14:13Z

Looks like theres an assert in the tests/fuzz/sequence_compression_api.c fuzzer that needs to be fixed.

terrelln

LGTM once the tests pass

terrelln · 2022-01-25T01:15:12Z

lib/common/error_private.c

@@ -27,7 +27,7 @@ const char* ERR_getErrorString(ERR_enum code)
    case PREFIX(version_unsupported): return "Version not supported";
    case PREFIX(frameParameter_unsupported): return "Unsupported frame parameter";
    case PREFIX(frameParameter_windowTooLarge): return "Frame requires too much memory for decoding";
-    case PREFIX(corruption_detected): return "Corrupted block detected";
+    case PREFIX(corruption_detected): return "Input corruption detected";


nit: Maybe "Data corruption detected"? It feels more natural than "Input corruption"

terrelln · 2022-01-26T00:24:27Z

lib/compress/zstd_compress.c

+                                        cctx->blockSize, remaining,
+                                        inSeqs, inSeqsSize, seqPos);
+        U32 const lastBlock = (blockSize == remaining);
+        assert(blockSize <= remaining);


Looks like this assert is failing.

Maybe it just needs to be moved below the FORWARD_IF_ERROR() check?

Yes, that's exactly the issue,
blockSize can be an error code, so this assert() is only valid after the FORWARD_IF_ERROR().

add explicit delimiter mode to libfuzzer test

Cyan4973 · 2022-01-26T17:36:57Z

The misplaced assert() was the only issue affecting the library.
The bulk of the work (and changes) was in the fuzz tester itself, in order to make it compatible with the Explicit Delimiter mode. As a nice side-effect, it allowed observing that the library was correctly rejecting all sort of bad conditions, from incorrect sequences, to invalid blocks, to incomplete frames, etc.
So I think this is finally good to go.

facebook-github-bot added the CLA Signed label Jan 22, 2022

Cyan4973 force-pushed the fix_seqCompress_withDelimiter branch 2 times, most recently from 0f2816a to 55f5946 Compare January 24, 2022 17:50

terrelln approved these changes Jan 25, 2022

View reviewed changes

fix sequence compression API in Explicit Delimiter mode

87dcd33

Cyan4973 force-pushed the fix_seqCompress_withDelimiter branch from 55f5946 to 87fb8a5 Compare January 25, 2022 21:34

terrelln reviewed Jan 26, 2022

View reviewed changes

Cyan4973 force-pushed the fix_seqCompress_withDelimiter branch 2 times, most recently from c89e7c8 to 6096d33 Compare January 26, 2022 07:33

refactored fuzzer tests for sequence compression api

fc2ea97

add explicit delimiter mode to libfuzzer test

Cyan4973 force-pushed the fix_seqCompress_withDelimiter branch from 6096d33 to fc2ea97 Compare January 26, 2022 08:29

Cyan4973 merged commit a0acf9a into dev Jan 26, 2022

Cyan4973 deleted the fix_seqCompress_withDelimiter branch January 13, 2023 04:28

Cyan4973 mentioned this pull request Feb 9, 2023

release v1.5.4 #3487

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix sequence compression API in Explicit Delimiter mode #3023

fix sequence compression API in Explicit Delimiter mode #3023

Cyan4973 commented Jan 22, 2022 •

edited

Loading

terrelln commented Jan 25, 2022

terrelln left a comment

terrelln Jan 25, 2022

terrelln Jan 26, 2022 •

edited

Loading

Cyan4973 Jan 26, 2022

Cyan4973 commented Jan 26, 2022

fix sequence compression API in Explicit Delimiter mode #3023

fix sequence compression API in Explicit Delimiter mode #3023

Conversation

Cyan4973 commented Jan 22, 2022 • edited Loading

terrelln commented Jan 25, 2022

terrelln left a comment

Choose a reason for hiding this comment

terrelln Jan 25, 2022

Choose a reason for hiding this comment

terrelln Jan 26, 2022 • edited Loading

Choose a reason for hiding this comment

Cyan4973 Jan 26, 2022

Choose a reason for hiding this comment

Cyan4973 commented Jan 26, 2022

Cyan4973 commented Jan 22, 2022 •

edited

Loading

terrelln Jan 26, 2022 •

edited

Loading