integration test for stopping target service of proxy feature broken #772

engelmi · 2024-03-04T10:49:52Z

Describe the bug

It seems that the proxy-service-stop-service feature is broken in the GH CI:
https://github.com/eclipse-bluechi/bluechi/actions/runs/8138365284/job/22240289714?pr=770#step:8:175

Running tests locally in the contianer setup doesn't lead to a failure.

It receives a timeout while waiting for the requesting.service to start (which resolves the proxy dependency).

Note:
Its currently a bit hard to debug since no journal logs and other artifacts are collected due to the pytest timeout (which can't be caught and its not intended to do so, apparently). Therefore, a small and simple custom implementation of a signal-based timeout might be better in the future.

To Reproduce

Running integration tests in the CI

Expected behavior

Test passes

The text was updated successfully, but these errors were encountered:

Relates to: eclipse-bluechi#772 When pytest-timeout triggers a timeout is triggered, the test will be exited with pytest.fail and test artifacts might not be collected - so debugging issues with timeouts become hard to debug. Therefore, replace pytest-timeout with a slim, custom implementation that throws an error when timeout is reached. In addition, lets distinguish between timeouts for setup, test execution and artifact collection. Signed-off-by: Michael Engel <mengel@redhat.com>

engelmi · 2024-03-04T11:33:52Z

It seems there are logs about too many open files:

11:18:34                 out: 2024-03-04 11:18:34+0000,097 DEBUG   [bluechi_test.test] Stopping all BlueChi components in all container... (test:99)                                                                                                          
11:18:34                 out: 2024-03-04 11:18:34+0000,148 DEBUG   [bluechi_test.client] Executed command 'systemctl stop bluechi-agent' with result '0' and output 'b''' (client:84)                                                                         
11:18:34                 out: 2024-03-04 11:18:34+0000,222 DEBUG   [bluechi_test.client] Executed command 'systemctl show --property="Result" bluechi-agent' with result '0' and output 'Result=success' (client:84)                                          
11:18:34                 out: 2024-03-04 11:18:34+0000,293 DEBUG   [bluechi_test.client] Executed command 'systemctl stop bluechi-agent' with result '0' and output 'Failed to allocate directory watch: Too many open files' (client:84)                     
11:18:34                 out: 2024-03-04 11:18:34+0000,368 DEBUG   [bluechi_test.client] Executed command 'systemctl show --property="Result" bluechi-agent' with result '0' and output 'Result=success' (client:84)

So this might be related to podman and the GH hosts in the CI.

Update:
It seems that if this issue occurs, it'll persist quite a while but eventually disappears after restarting the pipeline.

Relates to: eclipse-bluechi#772 When pytest-timeout triggers a timeout is triggered, the test will be exited with pytest.fail and test artifacts might not be collected - so debugging issues with timeouts become hard to debug. Therefore, replace pytest-timeout with a slim, custom implementation that throws an error when timeout is reached. In addition, lets distinguish between timeouts for setup, test execution and artifact collection. Signed-off-by: Michael Engel <mengel@redhat.com>

Relates to: #772 When pytest-timeout triggers a timeout is triggered, the test will be exited with pytest.fail and test artifacts might not be collected - so debugging issues with timeouts become hard to debug. Therefore, replace pytest-timeout with a slim, custom implementation that throws an error when timeout is reached. In addition, lets distinguish between timeouts for setup, test execution and artifact collection. Signed-off-by: Michael Engel <mengel@redhat.com>

mwperina · 2024-03-12T12:52:56Z

Should be fixed by #820 feel free to reopen if appears again

engelmi added the bug Something isn't working label Mar 4, 2024

engelmi mentioned this issue Mar 4, 2024

replaced pytest-timeout with custom timeout #773

Merged

This was referenced Mar 4, 2024

added code of conduct #771

Merged

added security.md #770

Merged

engelmi added the testing This issue adds or improves the testing label Mar 6, 2024

engelmi mentioned this issue Mar 7, 2024

added bluechi-proxy to help option test #809

Merged

mwperina mentioned this issue Mar 12, 2024

Increase inotify limits for integration tests in GH CI #820

Merged

mwperina closed this as completed Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

integration test for stopping target service of proxy feature broken #772

integration test for stopping target service of proxy feature broken #772

engelmi commented Mar 4, 2024

engelmi commented Mar 4, 2024 •

edited

Loading

mwperina commented Mar 12, 2024

integration test for stopping target service of proxy feature broken #772

integration test for stopping target service of proxy feature broken #772

Comments

engelmi commented Mar 4, 2024

Describe the bug

To Reproduce

Expected behavior

engelmi commented Mar 4, 2024 • edited Loading

mwperina commented Mar 12, 2024

engelmi commented Mar 4, 2024 •

edited

Loading