Improve stability of hostnames test #1016

madolson · 2024-09-11T05:33:52Z

Maybe partially resolves #952.

The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state.

I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration.

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

codecov · 2024-09-11T05:47:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 70.62%. Comparing base (9f0c801) to head (308357d).
Report is 6 commits behind head on unstable.

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #1016      +/-   ##
============================================
+ Coverage     70.60%   70.62%   +0.02%     
============================================
  Files           114      114              
  Lines         61651    61658       +7     
============================================
+ Hits          43526    43544      +18     
+ Misses        18125    18114      -11

see 15 files with indirect coverage changes

enjoy-binbin

nice!

zuiderkwast

Yeah, pause process seems more robust. Good idea!

tests/unit/cluster/hostnames.tcl

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

tests/unit/cluster/hostnames.tcl

Fix a typo Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

Maybe partially resolves valkey-io#952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

Maybe partially resolves valkey-io#952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>

Maybe partially resolves valkey-io#952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

Maybe partially resolves valkey-io#952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>

Maybe partially resolves #952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: Ping Xie <pingxie@google.com>

Maybe partially resolves valkey-io#952. The hostnames test relies on an assumption that node zero and node six don't communicate with each other to test a bunch of behavior in the handshake stake. This was done by previously dropping all meet packets, however it seems like there was some case where node zero was sending a single pong message to node 6, which was partially initializing the state. I couldn't track down why this happened, but I adjusted the test to simply pause node zero which also correctly emulates the state we want to be in since we're just testing state on node 6, and removes the chance of errant messages. The test was failing about 5% of the time locally, and I wasn't able to reproduce a failure with this new configuration. --------- Signed-off-by: Madelyn Olson <madelyneolson@gmail.com> Signed-off-by: naglera <anagler123@gmail.com>

Improve stability of hostnames test

49ed444

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

madolson requested review from zuiderkwast and enjoy-binbin September 11, 2024 05:34

madolson added the test-failure An issue indicating a test failure label Sep 11, 2024

enjoy-binbin approved these changes Sep 11, 2024

View reviewed changes

zuiderkwast reviewed Sep 11, 2024

View reviewed changes

tests/unit/cluster/hostnames.tcl Outdated Show resolved Hide resolved

Address comment

e4582bc

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

madolson requested a review from zuiderkwast September 11, 2024 15:38

madolson added 2 commits September 11, 2024 08:39

Use inclusive language

ada8366

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

Change variable name

8f779ec

Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

madolson commented Sep 11, 2024

View reviewed changes

tests/unit/cluster/hostnames.tcl Outdated Show resolved Hide resolved

Fix a typo

308357d

Fix a typo Signed-off-by: Madelyn Olson <madelyneolson@gmail.com>

zuiderkwast approved these changes Sep 11, 2024

View reviewed changes

madolson merged commit 2b207ee into valkey-io:unstable Sep 11, 2024
46 checks passed

enjoy-binbin mentioned this pull request Sep 12, 2024

Fix replica unable trigger migration when it received CLUSTER SETSLOT in advance #981

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve stability of hostnames test #1016

Improve stability of hostnames test #1016

madolson commented Sep 11, 2024

codecov bot commented Sep 11, 2024 •

edited

Loading

enjoy-binbin left a comment

zuiderkwast left a comment

Improve stability of hostnames test #1016

Improve stability of hostnames test #1016

Conversation

madolson commented Sep 11, 2024

codecov bot commented Sep 11, 2024 • edited Loading

Codecov Report

enjoy-binbin left a comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

codecov bot commented Sep 11, 2024 •

edited

Loading