Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wait for cluster fully online in cluster_config_consistent #272

Merged
merged 2 commits into from
Apr 9, 2024

Conversation

VoletiRam
Copy link
Contributor

@VoletiRam VoletiRam commented Apr 8, 2024

Wait for cluster to be in a fully consistent and online state in cluster_config_consistent. We expect the start_server to create the desired primaries and replicas before the start of the tests. With the current setup, the replicas may not complete the sync with primaries and can be in loading state. In some cases, the role of replicas can still be master with the delay of propagation of replicate command. The tests can show flaky behavior in such cases. Add a check that verifies the nodes health status 'online' for the cluster consistency. Leverage the deterministic order of CLUSTER SLOTS to consider the cluster as consistent along with the nodes health status.

hpatro and others added 2 commits April 8, 2024 22:07
Signed-off-by: Harkrishn Patro <harkrisp@amazon.com>
Fix `cluster_config_consistent` proc of tcl tests by verifying the
node status to be online. We expect the start_server to create the
desired primaries and replicas before the start of the tests. With
the current setup, the replicas may not complete the sync with
primaries and can be in loading state. In some cases, the
role of replicas can still be master with the delay of propagation of
replicate command. The tests can show flaky behavior in such cases. Add
a check that verifies the health status 'online' to consider the cluster
as consistent.

Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>
tests/cluster/cluster.tcl Show resolved Hide resolved
@hpatro
Copy link
Contributor

hpatro commented Apr 8, 2024

@zuiderkwast Here's the fix + tcl test coverage for #265

@hpatro hpatro requested a review from zuiderkwast April 8, 2024 22:57
@madolson madolson changed the title Fix cluster_config_consistent proc of tcl tests Wait for cluster fully online in cluster_config_consistent Apr 9, 2024
@madolson madolson added the polish typos, style, etc label Apr 9, 2024
@madolson madolson merged commit d89ef06 into valkey-io:unstable Apr 9, 2024
14 checks passed
PatrickJS pushed a commit to PatrickJS/placeholderkv that referenced this pull request Apr 24, 2024
…#272)

Wait for cluster to be in a fully consistent and online state in
`cluster_config_consistent`. We expect the `start_server` to create the
desired primaries and replicas before the start of the tests. With the
current setup, the replicas may not complete the sync with primaries and
can be in loading state. In some cases, the role of replicas can still
be master with the delay of propagation of replicate command. The tests
can show flaky behavior in such cases. Add a check that verifies the
nodes health status 'online' for the cluster consistency. Leverage the
deterministic order of `CLUSTER SLOTS` to consider the cluster as
consistent along with the nodes health status.

---------

Signed-off-by: Harkrishn Patro <harkrisp@amazon.com>
Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>
Co-authored-by: Harkrishn Patro <harkrisp@amazon.com>
Co-authored-by: Ram Prasad Voleti <ramvolet@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
polish typos, style, etc
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants