ShardFollowNodeTask should fetch operation once #32455

dnhatn · 2018-07-30T01:56:23Z

Today ShardFollowNodeTask might fetch some operations more than once.
This happens because we ask the leading for up to max_batch_count
operations (instead of the left-over size) for the left-over request.
The leading then can freely respond up to the max_batch_count, and at
the same time, if one of the previous requests completed, we might issue
another read request whose range overlaps with the response of the
left-over request.

Closes #32453

Today ShardFollowNodeTask might fetch some operations more than once. This happens because we ask the leading for up to max_batch_count operations (instead of the left-over size) for the left-over request. The leading then can freely respond up to the max_batch_count, and at the same time, if one of the previous requests completed, we might issue another read request whose range overlaps with the response of the left-over request. Closes elastic#32453

elasticmachine · 2018-07-30T01:56:25Z

Pinging @elastic/es-distributed

bleskes · 2018-07-30T06:27:41Z

I probably don't understand something I want to clarify. The idea of asking for more then the global checkpoint was that when we do that, we know it's the last request we're going to send out and it's also the only one requesting that range - might as well ask for more (until the request comes back with a new global checkpoint, we won't ask for anything else and when it does come back new requests will take the already fetched operations will be taken into account).

The log line in the bug report said:

[ShardFollowNodeTask] fetch from=1620, to=1679, receive [1620 2024] (*)

but I can't find that pattern in code, so I can't clarify what it means exactly.

What am I missing?

PS I'm not saying we shouldn't make the change you're suggesting but I want to understand it better. Regardless, it should be ok to fetch things twice in rare cases.

dnhatn · 2018-07-30T15:46:49Z

@bleskes

[ShardFollowNodeTask] fetch from=1620, to=1679, receive [1620 2024] (*)
but I can't find that pattern in code, so I can't clarify what it means exactly.

Sorry for the confusion. I added this log locally for debugging.

The idea of asking for more then the global checkpoint was that when we do that, we know it's the last request we're going to send out and it's also the only one requesting that range - might as well ask for more (until the request comes back with a new global checkpoint, we won't ask for anything else and when it does come back new requests will take the already fetched operations will be taken into account).

If the requesting is the only ongoing request, we should ask for max_batch_count. However, here I am fixing the left-over request, not the peak request (MVG's term).

Suppose the leader_global_checkpoint (fetched by the task) is 2018, last_request_seqno is 2, max_batch_count is 1000. The current code will send concurrently three requests:

( from=2, batch_count=1000, max_required=1001 ),
( from=1002, batch_count=1000, max_required=2001 ),
( from=2002, batch_count=1000, max_required=2018 )

The last_request_seqno is 2018 after issuing these three requests.

If the global checkpoint on the leader has advanced to 2999; when the first request completed, we will another read request (from=2019, batch_count=1000, max_required=2999). This request will receive operations from 2019 to 2999. The problem is that the third request (from=2002, batch_count=1000, max_required=2018) will also receive operations from 2002 to 2999. Here we fetch 2019 to 2999 twice.

PS I'm not saying we shouldn't make the change you're suggesting but I want to understand it better. Regardless, it should be ok to fetch things twice in rare cases.

Yep, I agree we should not enforce this all the time, but we should avoid in obvious cases.

bleskes

LGTM. Thanks for clarifying.

bleskes · 2018-07-30T16:27:45Z

x-pack/plugin/ccr/src/main/java/org/elasticsearch/xpack/ccr/action/ShardFollowNodeTask.java

+            final long maxRequiredSeqNo = Math.min(leaderGlobalCheckpoint, from + maxBatchOperationCount - 1);
+            final int requestBatchCount;
+            if (numConcurrentReads == 0) {
+                // If this is the only request, we can treat it as a peek read.


add "add let it optimistically fetch more documents if possible (but not require it)"?

dnhatn · 2018-07-31T00:52:59Z

Thanks @bleskes for reviewing.

* elastic/ccr: (57 commits) ShardFollowNodeTask should fetch operation once (elastic#32455) Do not expose hard-deleted docs in Lucene history (elastic#32333) Tests: Fix convert error tests to use fixed value (elastic#32415) IndicesClusterStateService should replace an init. replica with an init. primary with the same aId (elastic#32374) REST high-level client: parse back _ignored meta field (elastic#32362) [CI] Mute DocumentSubsetReaderTests testSearch Reject follow request if following setting not enabled on follower (elastic#32448) TEST: testDocStats should always use forceMerge (elastic#32450) TEST: avoid merge in testSegmentMemoryTrackedInBreaker TEST: Avoid deletion in FlushIT AwaitsFix IndexShardTests#testDocStats Painless: Add method type to method. (elastic#32441) Remove reference to non-existent store type (elastic#32418) [TEST] Mute failing FlushIT test Fix ordering of bootstrap checks in docs (elastic#32417) [TEST] Mute failing InternalEngineTests#testSeqNoAndCheckpoints Validate source of an index in LuceneChangesSnapshot (elastic#32288) [TEST] Mute failing testConvertLongHexError bump lucene version after backport Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (elastic#32390) ...

Today ShardFollowNodeTask might fetch some operations more than once. This happens because we ask the leading for up to max_batch_count operations (instead of the left-over size) for the left-over request. The leading then can freely respond up to the max_batch_count, and at the same time, if one of the previous requests completed, we might issue another read request whose range overlaps with the response of the left-over request. Closes #32453

dnhatn added >bug :Distributed Indexing/CCR Issues around the Cross Cluster State Replication features labels Jul 30, 2018

dnhatn requested review from martijnvg, bleskes and jasontedor July 30, 2018 01:56

elasticmachine mentioned this pull request Jul 30, 2018

Introduce cross-cluster replication #30086

Closed

29 tasks

dnhatn added 2 commits July 29, 2018 22:08

simplify + assertion

32b6f49

Merge branch 'ccr' into ccr-fix-request-twice

77d5c21

optimize for the peak request

b43b473

bleskes approved these changes Jul 30, 2018

View reviewed changes

dnhatn added 2 commits July 30, 2018 13:01

improve comment

a3313f3

Merge branch 'ccr' into ccr-fix-request-twice

96faca0

dnhatn merged commit 8cfbb64 into elastic:ccr Jul 31, 2018

dnhatn deleted the ccr-fix-request-twice branch July 31, 2018 00:53

dnhatn added the backport pending label Jul 31, 2018

dnhatn mentioned this pull request Jul 31, 2018

ShardFollowNodeTask fetch operations twice #32453

Closed

dnhatn removed the backport pending label Aug 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ShardFollowNodeTask should fetch operation once #32455

ShardFollowNodeTask should fetch operation once #32455

dnhatn commented Jul 30, 2018

elasticmachine commented Jul 30, 2018

bleskes commented Jul 30, 2018

dnhatn commented Jul 30, 2018 •

edited

Loading

bleskes left a comment

bleskes Jul 30, 2018

dnhatn commented Jul 31, 2018

ShardFollowNodeTask should fetch operation once #32455

ShardFollowNodeTask should fetch operation once #32455

Conversation

dnhatn commented Jul 30, 2018

elasticmachine commented Jul 30, 2018

bleskes commented Jul 30, 2018

dnhatn commented Jul 30, 2018 • edited Loading

bleskes left a comment

Choose a reason for hiding this comment

bleskes Jul 30, 2018

Choose a reason for hiding this comment

dnhatn commented Jul 31, 2018

dnhatn commented Jul 30, 2018 •

edited

Loading