Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (Failed to hydrate chunk start) in CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves #17834

Closed
r-vasquez opened this issue Apr 12, 2024 · 3 comments
Labels
area/storage ci-failure ci-ignore Automatic ci analysis tools ignore this issue kind/bug Something isn't working

Comments

@r-vasquez
Copy link
Contributor

r-vasquez commented Apr 12, 2024

https://buildkite.com/redpanda/redpanda/builds/47714#018ed105-f852-4954-aa02-df7ff3663dbb/6-7535

Module: rptest.tests.cloud_storage_timing_stress_test
Class:  CloudStorageTimingStressTest
Method: test_cloud_storage_with_partition_moves
Arguments:
{
  "cleanup_policy": "compact,delete"
}
test_id:    rptest.tests.cloud_storage_timing_stress_test.CloudStorageTimingStressTest.test_cloud_storage_with_partition_moves.cleanup_policy=compact.delete
status:     FAIL
run time:   3 minutes 16.089 seconds


    <BadLogLines nodes=docker-rp-11(2) example="ERROR 2024-04-12 07:02:42,210 [shard 1:fetc] cloud_storage - [fiber32 3935427d/kafka/test-topic/0_28/970-1103-4211020-1-v1.log.1] - segment_chunk_api.cc:226 - Failed to hydrate chunk start 1052902, error: Sleep is aborted">
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/mark/_mark.py", line 535, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/root/tests/rptest/services/cluster.py", line 177, in wrapped
    redpanda.raise_on_bad_logs(
  File "/root/tests/rptest/services/redpanda.py", line 1523, in raise_on_bad_logs
    lsearcher.search_logs(_searchable_nodes)
  File "/root/tests/rptest/services/utils.py", line 156, in search_logs
    raise BadLogLines(bad_loglines)
rptest.services.utils.BadLogLines: <BadLogLines nodes=docker-rp-11(2) example="ERROR 2024-04-12 07:02:42,210 [shard 1:fetc] cloud_storage - [fiber32 3935427d/kafka/test-topic/0_28/970-1103-4211020-1-v1.log.1] - segment_chunk_api.cc:226 - Failed to hydrate chunk start 1052902, error: Sleep is aborted">

JIRA Link: CORE-2346

@r-vasquez r-vasquez added kind/bug Something isn't working ci-failure labels Apr 12, 2024
@rockwotj rockwotj added ci-ignore Automatic ci analysis tools ignore this issue and removed ci-ignore Automatic ci analysis tools ignore this issue labels Apr 12, 2024
@nvartolomei nvartolomei marked this as a duplicate of #16585 Apr 22, 2024
@nvartolomei nvartolomei marked this as not a duplicate of #16585 Apr 22, 2024
@nvartolomei
Copy link
Contributor

The cause of the error is leadership loss leading to remote partition shutdown leading to http client shutdown leading. Doesn't seem too problematic, just bad error handling.

@nvartolomei
Copy link
Contributor

Fixed by #17828

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/storage ci-failure ci-ignore Automatic ci analysis tools ignore this issue kind/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants