Block splitter : minor reformatting #3376

Cyan4973 · 2022-12-18T19:23:37Z

After reading #3204, I wanted to have a look at the block splitter to understand what's going on.
This exercise lead to a (rather large) number of minor changes, which initially were cosmetic only, but eventually added a few correctness updates. They don't change the behavior of the functionality, but hopefully improve maintenance.

List of changes:

Shortened many long lines
Conversion from offset to offbase was done manually, instead of employing the dedicated macros
const correctness : seqStore was passed as a mutable buffer in scenarios where it was read only. The pb ran multiple levels deep, so several functions where controlled and their prototype updated to reflect this property.
Several variables and pointers could be made const, improving the understanding of what happens to them. Some variables could be removed, others could be moved to a more restrictive scope
Several fixes for -Wconversion warning level (note: not activated by default on zstd code base)
Some assert() were added, to better document what's the expected state of the system.
Added a few code comments

Only minor stuff, but I figure it was still valuable enough for maintenance to be merged.
None of these changes impact the behavior of the program. The source code is stricter, but it still does the same thing.

A side effect of this exercise is that I have a better understanding of the block splitter now.

The estimation logic, which judges if a splitting decision is beneficial or not, looks fine and seems generally re-usable.

The splitting decision though is still pretty raw, blindly mechanical.
Hence, that's where improvements could be inserted.

That being said, adding complexity at this stage is also likely going to impact speed.
So it might become desirable to feature multiple "levels" of block splitting.

and minor reliability and maintenance changes

terrelln · 2022-12-19T20:03:55Z

The splitting decision though is still pretty raw, blindly mechanical.
Hence, that's where improvements could be inserted.

Agreed, we definitely need better splitting logic, which can hopefully be (mostly) compression-algorithm agnostic.

facebook-github-bot added the CLA Signed label Dec 18, 2022

minor reformatting

832c1a6

and minor reliability and maintenance changes

Cyan4973 force-pushed the split2 branch from 9aef1f5 to 832c1a6 Compare December 18, 2022 19:27

terrelln approved these changes Dec 19, 2022

View reviewed changes

Cyan4973 merged commit 9073fe0 into dev Dec 19, 2022

Cyan4973 deleted the split2 branch January 13, 2023 04:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block splitter : minor reformatting #3376

Block splitter : minor reformatting #3376

Cyan4973 commented Dec 18, 2022 •

edited

Loading

terrelln commented Dec 19, 2022

Block splitter : minor reformatting #3376

Block splitter : minor reformatting #3376

Conversation

Cyan4973 commented Dec 18, 2022 • edited Loading

terrelln commented Dec 19, 2022

Cyan4973 commented Dec 18, 2022 •

edited

Loading