Linearizable barrier improvements #11939

mmaslankaprv · 2023-07-07T11:10:05Z

Improved the way how linearizable barrier is handled

Fixes: #11062

Backports Required

Release Notes

Improvements

Made linearizable barrier semantics more strict.

mergify · 2023-07-07T11:10:39Z

⚠️ The sha of the head commit of this PR conflicts with #11905. Mergify cannot evaluate rules on this PR. ⚠️

mergify · 2023-07-07T11:11:45Z

⚠️ The sha of the head commit of this PR conflicts with #11905. Mergify cannot evaluate rules on this PR. ⚠️

When linearizable barrier is requested we want follower to flush its log to make sure that all possible entries are committed with traditional raft semantics. Added handling of flushing log on the follower if leader requested it and append entries request is empty. Signed-off-by: Michal Maslanka <michal@redpanda.com>

When linearizable barrier is set we want to move committed offset forward. In this case followers must flush their offsets to allow leader committing its entries. Signed-off-by: Michal Maslanka <michal@redpanda.com>

For the STM linearizable barrier to make sense we must wait for the offset to be applied to the stm. Otherwise the linearizable barrier gives no guarantees about the state machine state. Signed-off-by: Michal Maslanka <michal@redpanda.com>

In order to prevent contention implemented sharing linearizable barrier result between contended callers. Instead of calling linearizable barrier multiple times a caller will wait for the result of a barrier that is already being executed. This doesn't change the current semnatics of linearizable barrier as either way a caller must check the returned offset if they want to wait for the whole history to be applied. Sharing results helps in a situation where multiple parallel fibers try to setup linearizable barrier. Signed-off-by: Michal Maslanka <michal@redpanda.com>

bharathv · 2023-07-11T17:37:49Z

src/v/raft/state_machine.cc

                   }

                   // wait for the returned offset to be applied
-                   return wait(r.value(), timeout).then([r] {
-                       return result<model::offset>(r.value());
+                   return wait(r.value(), timeout).then([this, r] {


I think there may be some undesirable side effects to this optimization.. In a happy path, the shared future resolves when the wait() succeeds, consider the following sequence of actions..

linearizable_barrier at offset o

usual replicate() at offset o + 1

linearizable_barrier at offset o + 2

(3) can potentially resolve with the same shared future as (1), we have a problem? Normally I don't think this is a problem if we validate the offset returned from linearizable_barrier (3) is > o + 1 but we don't in many places. Think we have this pattern in topics_frontend.. eg: if we create too many topics back to back and topics stm may not have applied the changes by the time barrier returns.

so the barrier semantics was like that before i.e. it could always race with the replicate(), this is why i decided to it this way. This is why we return an offset that is confirmed

so the barrier semantics was like that before

But 3bcf038 tightens that behavior but this optimization brings it back, no?

it could always race with the replicate()

just to be clear I'm talking about read-your-own-writes here, so its not just any replicate, its the replicate before the barrier.

you are right it may happen that the barrier called after the replicate will return, the value of previous barrier, it won't hurt in this case as that was the previous semantics, but it is definitely confusing.

vbotbuildovich · 2023-07-13T05:38:02Z

/backport v23.1.x

vbotbuildovich · 2023-07-13T05:39:09Z

Failed to run cherry-pick command. I executed the commands below:

git checkout -b backport-pr-11939-v23.1.x-903 remotes/upstream/v23.1.x
git cherry-pick -x 9cd72e50af8180a2e6c69a07be281b4770a6cfb1 f936b6d5335d0e580b1e040aca711aede1b1d034 3bcf0388c0de9d44ed0f953e93c19a4e59bfc44b 7cb919c0dda574ddab1595bd5b5423fcb2c65283

Workflow run logs.

github-actions bot added the area/redpanda label Jul 7, 2023

mmaslankaprv marked this pull request as draft July 7, 2023 13:35

mmaslankaprv force-pushed the fix-11062 branch from 13b1cf4 to cc88496 Compare July 7, 2023 15:11

mmaslankaprv added 4 commits July 10, 2023 12:59

r/consensus: request flush with linearizable barrier requests

f936b6d

When linearizable barrier is set we want to move committed offset forward. In this case followers must flush their offsets to allow leader committing its entries. Signed-off-by: Michal Maslanka <michal@redpanda.com>

mmaslankaprv force-pushed the fix-11062 branch from cc88496 to 7cb919c Compare July 10, 2023 10:59

mmaslankaprv requested review from bharathv, rystsov and ztlpn July 10, 2023 14:05

mmaslankaprv marked this pull request as ready for review July 10, 2023 14:30

rystsov approved these changes Jul 11, 2023

View reviewed changes

bharathv reviewed Jul 11, 2023

View reviewed changes

bharathv approved these changes Jul 12, 2023

View reviewed changes

mmaslankaprv merged commit 8dde461 into redpanda-data:dev Jul 13, 2023

mmaslankaprv deleted the fix-11062 branch July 13, 2023 05:37

vbotbuildovich mentioned this pull request Jul 13, 2023

[v23.1.x] Linearizable barrier improvements #12073

Closed

mmaslankaprv mentioned this pull request Jul 17, 2023

[v23.1.x] Backport of #9380 #8750 #10923 #11164 #11350 #11838 #11905 #11840 #11691 #11726 #10860 #12073 #12075

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linearizable barrier improvements #11939

Linearizable barrier improvements #11939

mmaslankaprv commented Jul 7, 2023 •

edited

Loading

mergify bot commented Jul 7, 2023

mergify bot commented Jul 7, 2023

bharathv Jul 11, 2023

mmaslankaprv Jul 11, 2023

bharathv Jul 11, 2023

mmaslankaprv Jul 12, 2023 •

edited

Loading

vbotbuildovich commented Jul 13, 2023

vbotbuildovich commented Jul 13, 2023

Linearizable barrier improvements #11939

Linearizable barrier improvements #11939

Conversation

mmaslankaprv commented Jul 7, 2023 • edited Loading

Backports Required

Release Notes

Improvements

mergify bot commented Jul 7, 2023

mergify bot commented Jul 7, 2023

bharathv Jul 11, 2023

Choose a reason for hiding this comment

mmaslankaprv Jul 11, 2023

Choose a reason for hiding this comment

bharathv Jul 11, 2023

Choose a reason for hiding this comment

mmaslankaprv Jul 12, 2023 • edited Loading

Choose a reason for hiding this comment

vbotbuildovich commented Jul 13, 2023

vbotbuildovich commented Jul 13, 2023

mmaslankaprv commented Jul 7, 2023 •

edited

Loading

mmaslankaprv Jul 12, 2023 •

edited

Loading