Reduce flakiness of CI test runs #4653

mbargull · 2022-11-25T11:17:29Z

Description

I started to include unrelated test fixes in #4650 to easier inspect failing/passing of relevant tests.
Those started to grow and it's cleaner to have them separated, anyway, so here we go.

What is the main change/result? (See details/motivation below.)

I went back an forth between having a fixture to add config=testing_config either completely implicit (autouse=True) or have it explicitly added to each and every test to make its use obvious.

Now I went the middle ground with pytestmark = pytest.mark.usefixtures("api_default_testing_config") to explicitly request the fixture for a test module but not on the single test level. If one wants to deactivate it for a single test but still use it for all other tests in a module, they can @pytest.mark.no_api_default_testing_config such test.

What is the overall motivation and why this back and forth, then middle ground, anyway?

The main cause of the tests' flakiness _{(apart from CRAN connection and occasional pip %TEMP% permission errors on Windows)} is that the parallel tests ought to write to separate locations but in reality most don't.
The mechanism to have them use separate locations is the testing_config fixture which sets Config.croot to a temporary directory.
So nearly all tests should explicitly run function(..., config=testing_config) for all functions that take a config parameter.
But actually, many don't, so they can occasionally run into concurrent write conflicts (see #4653 (comment) ).

Hence, the obvious fix is to add config=testing_config to all those tests.
That results in a large change set with somewhat verbose code, but is also not future-proof (since I expect it'll be forgotten to be added for newer tests again in the future).
The other solution is to have config=testing_config always unconditionally, implicitly used.
The downside with that is that it adds more hidden/non-obvious behavior.

The proposed change is to not go full explicit- but also not full-implicit mode by adding a fixture that can be

explicitly added to single tests via @pytest.mark.usefixtures("api_default_testing_config"),
implicitly added to all of a module's tests via pytestmark = pytest.mark.usefixtures("api_default_testing_config"),
explicitly deactivated for a single test via @pytest.mark.no_api_default_testing_config.

Checklist - did you ...

Add a file to the news directory (using the template) for the next release's release notes?
Add / update necessary tests?
Add / update outdated documentation?

mbargull · 2022-11-26T18:21:57Z

Comments copied from gh-4652:

I didn't add a news entry because this is just chore that doesn't touch anything outside the test code. If you'd still like to have a news entry, let me know.

Re: config=testing_config: Previously, tests failed because (even when only rendering) concurrent writes/deletions around the {croot}/work happened at conda_build.config.Config.compute_build_id (at the end in its if old_dir != work_dir: branch) happened. I didn't dig into it, but it might just be that another test creates the work (without a timestamp) folder and then compute_build_id wants to move it. In any case, testing_config sets the croot to testing_workdir and as such the single tests use their own root dir without seeing others' work folders.

mbargull · 2022-11-26T19:06:31Z

(Added description/motivation to the OP.)

jezdez

Thank you for working on this @mbargull!

I don't think your approach with rewriting the function defaults for conda_build.api is a good pattern and could be achieved by a more mundane monkey patching of the conda_build.config.get_or_merge_config function, to be automatically returning the appropriate test config.

While I enjoy seeing the use of pytestmark, I agree that it'll likely be forgotten by future contributors when adding new tests, creating rather unpleasant implicit differences in behavior again, increasing the risk of concurrency issues.

I would suggest to indeed use autouse=True to automatically enable the test config, if you think the majority of test modules need it (as it seems) and instead try to find a way to disable it when needed.

tests/conftest.py

mbargull · 2022-11-29T10:04:46Z

I don't think your approach with rewriting the function defaults for conda_build.api is a good pattern and could be achieved by a more mundane monkey patching of the conda_build.config.get_or_merge_config function, to be automatically returning the appropriate test config.

Haha, and that's what I get to only look at level n but not level n+1 :) -- I didn't even see/look for that all those functions use get_or_merge_config.
So, yes, very much agreed, I'll change it to monkeypatch get_or_merge_config; much cleaner.

I would suggest to indeed use autouse=True to automatically enable the test config, if you think the majority of test modules need it (as it seems) and instead try to find a way to disable it when needed.

Okay, I guess autouse=True is fine since we can opt out via the added @pytest.mark.no_<...>.
Mind you, this would mean that we create a temporary directory for every single test then (through testing_config -> testing_workdir -> tmpdir fixture requests).
We then probably want to set pytestmark = pytest.mark.no_default_testing_config for some test modules like test_api_consistency.py.

jezdez · 2022-11-29T10:21:56Z

I don't think your approach with rewriting the function defaults for conda_build.api is a good pattern and could be achieved by a more mundane monkey patching of the conda_build.config.get_or_merge_config function, to be automatically returning the appropriate test config.

Haha, and that's what I get to only look at level n but not level n+1 :) -- I didn't even see/look for that all those functions use get_or_merge_config. So, yes, very much agreed, I'll change it to monkeypatch get_or_merge_config; much cleaner.

Oops, conda-build keeps on giving 🤣

I would suggest to indeed use autouse=True to automatically enable the test config, if you think the majority of test modules need it (as it seems) and instead try to find a way to disable it when needed.

Okay, I guess autouse=True is fine since we can opt out via the added @pytest.mark.no_<...>. Mind you, this would mean that we create a temporary directory for every single test then (through testing_config -> testing_workdir -> tmpdir fixture requests). We then probably want to set pytestmark = pytest.mark.no_default_testing_config for some test modules like test_api_consistency.py.

Hmm, yeah, let's see what the impact on runtime is for that

jezdez · 2022-11-29T13:46:38Z

Seems like it's only one test failing now! test_skeleton_pypi_arguments_work

msumastro has been updated a couple of days ago and versions >=1.2 now store the metadata in setup.cfg which is not handled by skeletons.pypi. It is easy to fix this if we reuse the code from _load_setup_py_data but we might as well use/advertize newer projects like grayskull instead.

mbargull · 2022-11-29T15:35:13Z

Seems like it's only one test failing now! test_skeleton_pypi_arguments_work

Ha! That was a sneaky PyPI update since Saturday 😁 .
I fixed the test to use the same package version as before. The the commit message from c6463d9 for details.

mbargull · 2022-11-29T15:38:56Z

This is now a much smaller and cleaner change set with which I'm content :).
The test run time is not noticeably affected since we have rather heavy test cases that dominate all other running times.

@jezdez, thanks for your thorough and thoughtful review, I appreciate it!
Let me know if there's anything else you'd want me to change.

mbargull · 2022-11-29T17:09:24Z

Remaining failure is the permission error on Windows during pip install for peppercorn again.

kenodegard

Reran failed test, seems to be passing now

concerns have been resolved

mbargull · 2022-12-05T15:01:37Z

Looks like

@pytest.fixture(scope="function", autouse=True)
def default_testing_config(testing_config, monkeypatch, request):

doesn't work (reliably/at all?): #4665 (comment)

The default_testing_config monkeypatching fixture was added in condagh-4653 but did not consider "from .config import get_or_merge_config" cases in which get_or_merge_config is already bound and thus not patched. Signed-off-by: Marcel Bargull <marcel.bargull@udo.edu>

* Fix testing config monkeypatch for concurrent test flakiness The default_testing_config monkeypatching fixture was added in gh-4653 but did not consider "from .config import get_or_merge_config" cases in which get_or_merge_config is already bound and thus not patched. * main_build: Construct config via get_or_merge_config Helps tests with default_testing_config monkeypatch. * Tests: Don't preset values for get_or_merge_config Otherwise default values set for testing_config before, e.g., environment variable-dependent config can be set in the tests. Example: tests/cli/test_main_build.py::test_conda_py_no_period failed since it sets CONDA_PY=36 but testing_config already had set it before. --------- Signed-off-by: Marcel Bargull <marcel.bargull@udo.edu> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

test: Remove another PY2 bit; So long, pal...

6dd7ab4

conda-bot added the cla-signed [bot] added once the contributor has signed the CLA label Nov 25, 2022

mbargull force-pushed the monkeypatch-api-testing_config branch 2 times, most recently from 5d89c95 to 65abf81 Compare November 25, 2022 20:48

mbargull added 2 commits November 25, 2022 22:51

test: Mark flaky skeleton tests, retry with delay

64c0fff

test: Fix typo in testing_config fixture

e4843c8

mbargull force-pushed the monkeypatch-api-testing_config branch 3 times, most recently from a565200 to 330a777 Compare November 26, 2022 13:29

mbargull added 2 commits November 26, 2022 16:03

test: Monkeypatch api to default to testing_config

b055905

test: use api_default_testing_config fixture

8f6bfb4

mbargull force-pushed the monkeypatch-api-testing_config branch from 330a777 to 8f6bfb4 Compare November 26, 2022 15:13

mbargull mentioned this pull request Nov 26, 2022

Reduce flakiness of CI test runs #4652

Closed

3 tasks

mbargull marked this pull request as ready for review November 26, 2022 19:06

mbargull mentioned this pull request Nov 26, 2022

fix: conda-build CLI overrode condarc's zstd_compression_level with the default value #4650

Merged

3 tasks

jezdez previously requested changes Nov 29, 2022

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

mbargull force-pushed the monkeypatch-api-testing_config branch from 2ba8bad to efe302f Compare November 29, 2022 10:19

mbargull force-pushed the monkeypatch-api-testing_config branch 2 times, most recently from 25293a5 to a29a4be Compare November 29, 2022 10:33

test: monkeypatch get_or_merge_config instead api

0f4becf

mbargull force-pushed the monkeypatch-api-testing_config branch from a29a4be to 0f4becf Compare November 29, 2022 10:45

jezdez added this to the 3.23.2 milestone Nov 29, 2022

kenodegard mentioned this pull request Nov 29, 2022

Pin msumastro to 1.1.6 #4657

Closed

3 tasks

kenodegard approved these changes Nov 29, 2022

View reviewed changes

kenodegard merged commit 1053650 into conda:main Nov 29, 2022

jaimergp mentioned this pull request Jun 28, 2023

Testing bug: fixture testing_config 's boolify returns True regardless the value #4435

Closed

mbargull mentioned this pull request Nov 10, 2023

Fix testing config monkeypatch for concurrent test flakiness #5068

Merged

3 tasks

github-actions bot added the locked [bot] locked due to inactivity label Dec 6, 2023

github-actions bot locked as resolved and limited conversation to collaborators Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce flakiness of CI test runs #4653

Reduce flakiness of CI test runs #4653

mbargull commented Nov 25, 2022 •

edited

Loading

mbargull commented Nov 26, 2022

mbargull commented Nov 26, 2022

jezdez left a comment

mbargull commented Nov 29, 2022 •

edited

Loading

jezdez commented Nov 29, 2022

jezdez commented Nov 29, 2022

mbargull commented Nov 29, 2022

mbargull commented Nov 29, 2022

mbargull commented Nov 29, 2022

kenodegard left a comment

mbargull commented Dec 5, 2022

Reduce flakiness of CI test runs #4653

Reduce flakiness of CI test runs #4653

Conversation

mbargull commented Nov 25, 2022 • edited Loading

Description

What is the main change/result? (See details/motivation below.)

What is the overall motivation and why this back and forth, then middle ground, anyway?

Checklist - did you ...

mbargull commented Nov 26, 2022

mbargull commented Nov 26, 2022

jezdez left a comment

Choose a reason for hiding this comment

mbargull commented Nov 29, 2022 • edited Loading

jezdez commented Nov 29, 2022

jezdez commented Nov 29, 2022

mbargull commented Nov 29, 2022

mbargull commented Nov 29, 2022

mbargull commented Nov 29, 2022

kenodegard left a comment

Choose a reason for hiding this comment

mbargull commented Dec 5, 2022

mbargull commented Nov 25, 2022 •

edited

Loading

mbargull commented Nov 29, 2022 •

edited

Loading