Remove duplicate and random algorithms from tests #9626

PrivatePuffin · 2019-11-26T14:46:12Z

This is a spinoff from #8941 seperating general, unrelated, test changes from the zstd PR.

TL:DR:

Random tests (send-c_verify_ratio) lead to inpredicatble results
Duplicate algorithms lead to duplicate execution of tests
Combined the effects are even worse and lead to false positives

Signed-off-by: Kjeld Schouten-Lebbing kjeld@schouten-lebbing.nl

Motivation and Context

During my work on investigating the compression test suite for #8941 a few things became appearant:
send-c_verify_ratio used randomly selected compression algorithms. Those algorithms differ widely in performance and for the majority consist of different levels of gzip. This lead to difficulty comparing performance of tests over multiple runs. This also lead to situations where tests ended with a false-positive (pass) result, due to algorithms being skipped.

During my research into this there seemed to be 2 versions of multi-compression test suits:

Randomly selected from a pool listing all different levels of algorithms
Fixed selective non-random selection of algorithms

The 2 tests using version 1, both suffered from the same problems. (listed above) This wouldn't be that bad, but it gets worse when new algorithms get added with multiple levels.

Description

This change removes duplicate algorithms from all compression tests (and leaves just 1 of each kind), including the random compression pool.

It also removed the random draw of algorithms from send-c_verify_ratio, to make sure results are repeatable. This also removes one whole loop of the test and thus improves performance.

Is * taken into account?

The following is taken into account:

There is a seperate test for testing aliasses, those should work regardless
If many more algorithms get added, possibly in the future in some places random selection might be needed, but still should only list one level each to prevent limiting the chances of drawing a non-level based algorithm like lz4
If seperate levels work totally differently, a seperate test for that algorithm would be a cleaner solution than adding all levels everywhere.

How Has This Been Tested?

i've been debating and testing multiple versions/levels of these changes for about a week now. After I came up with them due to (unrelated) issues in #8941

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the ZFS on Linux code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
[] All new and existing tests passed.
All commit messages are properly formatted and contain Signed-off-by.

- Random tests (send-c_verify_ratio) lead to inpredicatble results - Duplicate algorithms lead to duplicate execution of tests - Combined the effects are even worse and lead to false positives Signed-off-by: Kjeld Schouten-Lebbing <kjeld@schouten-lebbing.nl>

codecov · 2019-11-26T23:26:44Z

Codecov Report

Merging #9626 into master will decrease coverage by 0.19%.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           master    #9626     +/-   ##
=========================================
- Coverage   79.37%   79.18%   -0.2%     
=========================================
  Files         418      418             
  Lines      123531   123531             
=========================================
- Hits        98057    97817    -240     
- Misses      25474    25714    +240

Flag	Coverage Δ
#kernel	`79.9% <ø> (-0.02%)`	⬇️
#user	`66.79% <ø> (-0.51%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0c46813...0823d84. Read the comment docs.

PrivatePuffin · 2019-11-27T07:10:01Z

*note
Small thank-you to @c0d3z3r0 for mentioning general changes should be kept out of zstd when possible. If I complain about those of others, my own should be seperated as well :)

*additional note:
Test failures are unrelated, known issues

PrivatePuffin · 2019-11-27T18:58:01Z

@behlendorf You might want a restart on those failed tests (unrelated).
But I don't think its worth it, it has absolutely nothing to do with the PR and would've been caught by those tests regardless if there where any issues with this.

behlendorf

Thanks for looking in to this! Looks good, I agree we don't want to test each different compression level, and making the tests behave more consistently is a good thing.

c0d3z3r0 · 2019-11-28T07:29:53Z

What about tests/zfs-tests/include/libtest.shlib?

c0d3z3r0 · 2019-11-28T07:40:20Z

tests/zfs-tests/tests/functional/rsend/rsend_012_pos.ksh

-	rand_set_prop $vol compression "on" "off" "lzjb" "gzip" \
-		"gzip-1" "gzip-2" "gzip-3" "gzip-4" "gzip-5" "gzip-6"   \
-		"gzip-7" "gzip-8" "gzip-9"
+	rand_set_prop $vol compression "off" "lzjb" "gzip" "lz4"


add zle? (separate PR?)

ZLE with random writen data (which is often used) is basically the same as compression=off

c0d3z3r0 · 2019-11-28T07:40:38Z

Oh, I just stumbled over this... aren't we missing zle and lz4 in function get_compress_opts in tests/zfs-tests/include/libtest.shlib?

PrivatePuffin · 2019-11-28T07:41:58Z

@c0d3z3r0 I purposefully left this PR to do one thing only:
Remove duplicate algorithms and random selected algorithms where possible.

I have decided not to add the added algorithms brainslayer added to my PR.

lz4 is included in libtest.shlib as on
I've decided not to remove the gzip list there, I am not 100% certain that list should contain just 1 gzip variant.... The way brainslayer did those changes didn;t look completely right to me, throwing in GZIP_OPTS with just 1 value and adding ZSTD_OPTS with also just one value...

But i'll look into it today, Just need to go over all instances get_compress_opts is called and make sure what the consequences of these changes would be.

PrivatePuffin · 2019-11-28T08:22:43Z

@c0d3z3r0
You made me realise the fact we might need a bigger refactor of this, I'll close this for now while I work on that. We can always reopen if i'm not happy with how that works out :)

behlendorf added the Status: Code Review Needed Ready for review and testing label Nov 27, 2019

behlendorf approved these changes Nov 27, 2019

View reviewed changes

behlendorf added Status: Accepted Ready to integrate (reviewed, tested) and removed Status: Code Review Needed Ready for review and testing labels Nov 27, 2019

c0d3z3r0 reviewed Nov 28, 2019

View reviewed changes

PrivatePuffin closed this Nov 28, 2019

PrivatePuffin mentioned this pull request Nov 28, 2019

Test Compression Algorithm Selection Refactor #9645

Merged

12 tasks

c0d3z3r0 mentioned this pull request Dec 16, 2019

Introduce ZSTD compression to ZFS #9735

Closed

12 tasks

PrivatePuffin deleted the test-fix branch December 19, 2019 18:59

c0d3z3r0 mentioned this pull request May 1, 2020

Introduce ZSTD compression to ZFS #10278

Closed

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove duplicate and random algorithms from tests #9626

Remove duplicate and random algorithms from tests #9626

PrivatePuffin commented Nov 26, 2019

codecov bot commented Nov 26, 2019 •

edited

Loading

PrivatePuffin commented Nov 27, 2019 •

edited

Loading

PrivatePuffin commented Nov 27, 2019

behlendorf left a comment

c0d3z3r0 commented Nov 28, 2019

c0d3z3r0 Nov 28, 2019

PrivatePuffin Nov 28, 2019

c0d3z3r0 commented Nov 28, 2019 •

edited

Loading

PrivatePuffin commented Nov 28, 2019 •

edited

Loading

PrivatePuffin commented Nov 28, 2019

Remove duplicate and random algorithms from tests #9626

Remove duplicate and random algorithms from tests #9626

Conversation

PrivatePuffin commented Nov 26, 2019

Motivation and Context

Description

Is * taken into account?

How Has This Been Tested?

Types of changes

Checklist:

codecov bot commented Nov 26, 2019 • edited Loading

Codecov Report

PrivatePuffin commented Nov 27, 2019 • edited Loading

PrivatePuffin commented Nov 27, 2019

behlendorf left a comment

Choose a reason for hiding this comment

c0d3z3r0 commented Nov 28, 2019

c0d3z3r0 Nov 28, 2019

Choose a reason for hiding this comment

PrivatePuffin Nov 28, 2019

Choose a reason for hiding this comment

c0d3z3r0 commented Nov 28, 2019 • edited Loading

PrivatePuffin commented Nov 28, 2019 • edited Loading

PrivatePuffin commented Nov 28, 2019

codecov bot commented Nov 26, 2019 •

edited

Loading

PrivatePuffin commented Nov 27, 2019 •

edited

Loading

c0d3z3r0 commented Nov 28, 2019 •

edited

Loading

PrivatePuffin commented Nov 28, 2019 •

edited

Loading