Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (rpk cluster maintenance disable timed out) in PartitionBalancerTest.test_maintenance_mode #10848

Closed
dlex opened this issue May 18, 2023 · 7 comments · Fixed by #13587

Comments

@dlex
Copy link
Contributor

dlex commented May 18, 2023

on (amd64, container) in job https://buildkite.com/redpanda/redpanda/builds/29202#01882328-a2b0-4c44-935f-ec0b02cd5046

Module: rptest.tests.partition_balancer_test
Class:  PartitionBalancerTest
Method: test_maintenance_mode
Arguments:
{
  "kill_same_node": false
}
test_id:    rptest.tests.partition_balancer_test.PartitionBalancerTest.test_maintenance_mode.kill_same_node=False
status:     FAIL
run time:   1 minute 17.140 seconds


    RpkException('command /var/lib/buildkite-agent/builds/buildkite-amd64-xfs-builders-i-09b5b635b195e82c5-1/redpanda/redpanda/vbuild/redpanda_installs/ci/bin/rpk --api-urls docker-rp-18:9644,docker-rp-12:9644,docker-rp-13:9644,docker-rp-14:9644,docker-rp-11:9644 cluster maintenance disable 4 timed out')
Traceback (most recent call last):
  File "/root/tests/rptest/clients/rpk.py", line 847, in _execute
    output, error = p.communicate(input=stdin, timeout=timeout)
  File "/usr/lib/python3.10/subprocess.py", line 1152, in communicate
    stdout, stderr = self._communicate(input, endtime, timeout)
  File "/usr/lib/python3.10/subprocess.py", line 2004, in _communicate
    self._check_timeout(endtime, orig_timeout, stdout, stderr)
  File "/usr/lib/python3.10/subprocess.py", line 1196, in _check_timeout
    raise TimeoutExpired(
subprocess.TimeoutExpired: Command '['/var/lib/buildkite-agent/builds/buildkite-amd64-xfs-builders-i-09b5b635b195e82c5-1/redpanda/redpanda/vbuild/redpanda_installs/ci/bin/rpk', '--api-urls', 'docker-rp-18:9644,docker-rp-12:9644,docker-rp-13:9644,docker-rp-14:9644,docker-rp-11:9644', 'cluster', 'maintenance', 'disable', '4']' timed out after 30 seconds

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/usr/local/lib/python3.10/dist-packages/ducktape/mark/_mark.py", line 481, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/root/tests/rptest/services/cluster.py", line 49, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/partition_balancer_test.py", line 799, in test_maintenance_mode
    rpk.cluster_maintenance_disable(node)
  File "/root/tests/rptest/clients/rpk.py", line 890, in cluster_maintenance_disable
    return self._execute(cmd)
  File "/root/tests/rptest/clients/rpk.py", line 850, in _execute
    raise RpkException(f"command {' '.join(cmd)} timed out")
@VladLazar
Copy link
Contributor

FAIL test: PartitionBalancerTest.test_maintenance_mode.kill_same_node=False (1/67 runs)
  failure at 2023-05-22T14:42:14.910Z: RpkException('command /var/lib/buildkite-agent/builds/buildkite-bk-amd64-xfs-builders-i-0b99a34fc5d3377e4-1/redpanda/redpanda/vbuild/redpanda_installs/ci/bin/rpk --api-urls docker-rp-11:9644,docker-rp-12:9644,docker-rp-13:9644,docker-rp-14:9644,docker-rp-15:9644 cluster maintenance disable 3 timed out')
      on (amd64, container) in job https://buildkite.com/redpanda/redpanda/builds/29583#018843b9-86de-484f-aad1-807a682ce987

@NyaliaLui
Copy link
Contributor

@andijcr
Copy link
Contributor

andijcr commented Jun 5, 2023

@andijcr
Copy link
Contributor

andijcr commented Jun 9, 2023

@michael-redpanda
Copy link
Contributor

@abhijat
Copy link
Contributor

abhijat commented Jun 23, 2023

@dotnwat
Copy link
Member

dotnwat commented Jun 27, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

9 participants