[CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure #51253

andreidan · 2020-01-21T13:57:30Z

Encountered this failure on a feature branch https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+pull-request-1/14196/

A build scan is available here https://gradle-enterprise.elastic.co/s/y3sp3pow27tp2

elasticmachine · 2020-01-21T13:57:33Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-01-21T16:10:15Z

Looks like this may have been introduced by #50788 ... will create a fix shortly

On master failover we have to resent all the shard failed messages, but the transport requests remain the same in the eyes of `equals`. If the master failover is registered and the requests to the new master are sent before all the callbacks have executed and the request to the old master removed from the deduplicator then the requuests to the new master will incorrectly fail and the snapshot get stuck. Closes elastic#51253

On master failover we have to resent all the shard failed messages, but the transport requests remain the same in the eyes of `equals`. If the master failover is registered and the requests to the new master are sent before all the callbacks have executed and the request to the old master removed from the deduplicator then the requuests to the new master will incorrectly fail and the snapshot get stuck. Closes #51253

On master failover we have to resent all the shard failed messages, but the transport requests remain the same in the eyes of `equals`. If the master failover is registered and the requests to the new master are sent before all the callbacks have executed and the request to the old master removed from the deduplicator then the requuests to the new master will incorrectly fail and the snapshot get stuck. Closes elastic#51253

On master failover we have to resent all the shard failed messages, but the transport requests remain the same in the eyes of `equals`. If the master failover is registered and the requests to the new master are sent before all the callbacks have executed and the request to the old master removed from the deduplicator then the requuests to the new master will incorrectly fail and the snapshot get stuck. Closes #51253

andreidan added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >test-failure Triaged test failures from CI labels Jan 21, 2020

andreidan changed the title ~~DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure~~ [CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure Jan 21, 2020

original-brownbear self-assigned this Jan 21, 2020

original-brownbear mentioned this issue Jan 21, 2020

Fix Overly Aggressive Request DeDuplication #51270

Merged

original-brownbear closed this as completed in #51270 Jan 22, 2020

original-brownbear mentioned this issue Jan 22, 2020

Fix Overly Optimistic Request Deduplication (#51270) #51291

Merged

original-brownbear mentioned this issue Jan 22, 2020

Fix Overly Optimistic Request Deduplication (#51270) #51293

Merged

This was referenced Feb 3, 2020

[meta] 7.6 release elastic/elasticsearch-net#4340

Closed

[meta] 7.6 release elastic/elasticsearch-net#4341

Closed

codebrain mentioned this issue Apr 1, 2020

7.7.0 meta ticket (Part 3) elastic/elasticsearch-net#4534

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure #51253

[CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure #51253

andreidan commented Jan 21, 2020

elasticmachine commented Jan 21, 2020

original-brownbear commented Jan 21, 2020

[CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure #51253

[CI] DedicatedClusterSnapshotRestoreIT.testMasterShutdownDuringSnapshot failure #51253

Comments

andreidan commented Jan 21, 2020

elasticmachine commented Jan 21, 2020

original-brownbear commented Jan 21, 2020