Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roachtest: gossip/chaos/nodes=9 failed #126545

Closed
cockroach-teamcity opened this issue Jul 2, 2024 · 1 comment
Closed

roachtest: gossip/chaos/nodes=9 failed #126545

cockroach-teamcity opened this issue Jul 2, 2024 · 1 comment
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. T-kv KV Team X-duplicate Closed as a duplicate of another issue.

Comments

@cockroach-teamcity
Copy link
Member

cockroach-teamcity commented Jul 2, 2024

roachtest.gossip/chaos/nodes=9 failed with artifacts on master @ 9e1cd533828b7887a48db8635f705669287cf2d6:

(gossip.go:81).2: gossip did not stabilize (dead node 2) in 56.4s
test artifacts and logs in: /artifacts/gossip/chaos/nodes=9/cpu_arch=arm64/run_1

Parameters:

  • ROACHTEST_arch=arm64
  • ROACHTEST_cloud=azure
  • ROACHTEST_coverageBuild=false
  • ROACHTEST_cpu=4
  • ROACHTEST_encrypted=false
  • ROACHTEST_metamorphicBuild=false
  • ROACHTEST_ssd=0
Help

See: roachtest README

See: How To Investigate (internal)

Grafana is not yet available for azure clusters

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

Jira issue: CRDB-39970

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. T-kv KV Team labels Jul 2, 2024
@arulajmani arulajmani added X-duplicate Closed as a duplicate of another issue. and removed release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. labels Jul 3, 2024
@arulajmani
Copy link
Collaborator

arulajmani commented Jul 3, 2024

We hypothesized that the test was sleeping longer than it's intended 1s duration, and added more logging in #126087 to prove this. It seems like we are indeed sleeping for longer than 1s, but it still doesn't account for the 56s wait to print the "checking if gossip excludes dead node" log line.

I'm not sure what to make of this, but I'll close this out as a duplicate of #124828.

2024/07/02 09:43:18 gossip.go:92: 1: gossip not ok (dead node 2 present) (3s)
2024/07/02 09:43:18 gossip.go:132: sleeping for 1s (3s)
2024/07/02 09:44:11 gossip.go:126: checking if gossip excludes dead node 2 (56s)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. T-kv KV Team X-duplicate Closed as a duplicate of another issue.
Projects
No open projects
Status: roachtest/unit test backlog
Development

No branches or pull requests

2 participants