Add tests for single threaded CLI 4GB @ all strategies to trigger index reduction #2601

senhuang42 · 2021-05-03T17:43:26Z

#2598 fixes a bug with rowhash that gets large enough to trigger index reduction.

Here, we add some CLI tests that would have trivially caught this (with ASAN). We add a ST 4GB roundtrip test for each strategy, so that future modifications to other strategies will also get tested with huge files.

We also add a ubsan-asan gh actions test.

Test Plan:

verifies that this fails on dev, passes with Fix chaintable check to include rowhash in ZSTD_reduceIndex() #2598

Cyan4973 · 2021-05-03T18:56:27Z

While this adds some much needed test coverage,
it also seems to weight down CI tests in important ways.

Longest test detected in this run :

generic-dev / test (pull_request) Successful in 51m

which I presume is related to large files single-threaded tests across all strategies.
In contrast, without this new set of tests, the same test seems be shorter. For example, in #2598 :

generic-dev / test (pull_request) Successful in 24m

and in #2597:

generic-dev / test (pull_request) Successful in 24m

So that's not a one-off, and it's far more than 3 minutes.
On top of that, it's making one of our longest tests even longer, thus worsening feedback signal delay.
I don't know what's the time-out limit of Github Actions, but it's getting dangerously close.

Finally, the new test asan-ubsan-testzstd fails, which I presume is expected,
but the important point here is that this premature exit prevents us from measuring the final time spent into the new section large files single-threaded tests across all strategies, since it bails out as soon as the error is triggered.
I would expect this test to be far slower than just make test, without uasan, which we just mentioned doubled in test time spent to reach something close to an hour.

This is a question of balance.
We need better test coverage, but not at the cost of everything else,
and not at the cost of significantly worse CI feedback loop.

Try to find ways to make the feedback loop less impacted.
One possibility here would be to separate the test, so that it's not part of make test, which is run a lot of time already.
Possibly, split the test across VM, so that each one of them remain <= ~20mn limit if possible.

senhuang42 · 2021-05-03T19:02:24Z

Possibly, split the test across VM, so that each one of them remain <= ~20mn limit if possible.

Ah interesting. I'm guessing we can just add more gh actions that basically just each perform the action:
datagen -g4000M -P99 | zstd -v --zstd=strategy=1 --single-thread | zstd -d --zstd=strategy=1 --single-thread, rather than adding this to make test. Though that does mean that the test only exists in CI, and not in local tests.

Edit: still looks like it takes too long

terrelln · 2021-05-03T21:51:33Z

We might be able to abandon this in favor of #2603.

senhuang42 · 2021-05-03T21:54:40Z

We might be able to abandon this in favor of #2603.

Agreed, this would be redundant.

facebook-github-bot added the CLA Signed label May 3, 2021

senhuang42 mentioned this pull request May 3, 2021

Fix chaintable check to include rowhash in ZSTD_reduceIndex() #2598

Merged

senhuang42 force-pushed the rowhash_cli_test branch from aa75518 to f5d481d Compare May 3, 2021 18:55

senhuang42 force-pushed the rowhash_cli_test branch 2 times, most recently from 5245b36 to 102074d Compare May 3, 2021 21:39

Add GH Actions large file test

102074d

senhuang42 closed this May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for single threaded CLI 4GB @ all strategies to trigger index reduction #2601

Add tests for single threaded CLI 4GB @ all strategies to trigger index reduction #2601

senhuang42 commented May 3, 2021 •

edited

Loading

Cyan4973 commented May 3, 2021 •

edited

Loading

senhuang42 commented May 3, 2021 •

edited

Loading

terrelln commented May 3, 2021

senhuang42 commented May 3, 2021

Add tests for single threaded CLI 4GB @ all strategies to trigger index reduction #2601

Add tests for single threaded CLI 4GB @ all strategies to trigger index reduction #2601

Conversation

senhuang42 commented May 3, 2021 • edited Loading

Cyan4973 commented May 3, 2021 • edited Loading

senhuang42 commented May 3, 2021 • edited Loading

terrelln commented May 3, 2021

senhuang42 commented May 3, 2021

senhuang42 commented May 3, 2021 •

edited

Loading

Cyan4973 commented May 3, 2021 •

edited

Loading

senhuang42 commented May 3, 2021 •

edited

Loading