Dynamic replication batch size #4301

yux0 · 2021-07-07T18:35:19Z

What changed?
Dynamic replication batch size

Why?
The replication batch size should be calculated based on the backlog in a particular shard. This could help with hot shards

How did you test it?
Unit tests
TODO: bench tests

Potential risks

Release notes

Documentation Changes

coveralls · 2021-07-07T18:57:31Z

Pull Request Test Coverage Report for Build 2a76fb8d-1110-483a-acad-b0b1184ceb47

35 of 37 (94.59%) changed or added relevant lines in 2 files are covered.
12 unchanged lines in 4 files lost coverage.
Overall coverage increased (+0.04%) to 56.439%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
service/history/replication/task_ack_manager.go	34	36	94.44%

Files with Coverage Reduction	New Missed Lines	%
service/history/queue/transfer_queue_processor.go	2	57.18%
service/matching/taskListManager.go	2	74.09%
service/history/queue/timer_gate.go	3	95.83%
common/task/fifoTaskScheduler.go	5	84.54%

Totals
Change from base Build 1535c579-3d77-4509-a5ff-d2f42c49ba16:	0.04%
Covered Lines:	79138
Relevant Lines:	140219

💛 - Coveralls

vytautas-karpavicius · 2021-07-08T14:00:10Z

service/history/replication/task_ack_manager.go

@@ -52,6 +53,8 @@ var (
 	errUnknownQueueTask       = errors.New("unknown task type")
 	errUnknownReplicationTask = errors.New("unknown replication task")
 	defaultHistoryPageSize    = 1000
+	minReadTaskSize           = 20
+	maxReplicationLatency     = int64(40)


I suggest adding time unit suffix (I guess Seconds) or even better use time.Duration instead. As of now it is not clear what 40 means.

demirkayaender · 2021-07-08T16:20:04Z

service/history/replication/task_ack_manager.go

+	t.taskLock.Lock()
+	taskLatency := int64(time.Now().Sub(t.lastTaskCreationTime) / time.Second)
+	t.taskLock.Unlock()


Trying to understand how useful this lock is: it feels like having and not having the lock here are the same to me. Could you you explain a bit what it is achieving here?

Are the time assignments in go not atomic that if we try to read its value while assigning somewhere else we get a crash?

Most likely it will not be used. I added it to prevent concurrent calls to the method. Updated it with atomic.Value

common/dynamicconfig/constants.go

service/history/config/config.go

service/history/replication/task_ack_manager.go

…into dynamic-batch-size

common/dynamicconfig/constants.go

…into dynamic-batch-size

yycptt

LGTM. Feel free to land after addressing the comments.

yycptt · 2021-08-27T21:43:50Z

common/dynamicconfig/constants.go

+	// ReplicatorUpperLatency indicates the max allowed replication latency between clusters
+	// KeyName: history.replicatorUpperLatencyInSeconds
+	// Value type: Duration
+	// Default value: 40


nit: 40 * time.Second

yycptt · 2021-08-27T21:47:31Z

service/history/replication/task_ack_manager.go

+		rateLimiter:          rateLimiter,
+		retryPolicy:          retryPolicy,
+		lastTaskCreationTime: atomic.Value{},
+		maxAllowedLatencyFn:  config.ReplicatorUpperLatencyInSeconds,


config.ReplicatorUpperLatencyInSeconds is no longer defined I think.

yycptt · 2021-08-27T21:48:33Z

service/history/replication/task_ack_manager.go

+	if t.lastTaskCreationTime.Load() == nil {
+		return defaultBatchSize
+	}
+	taskLatency := now.Sub(t.lastTaskCreationTime.Load().(time.Time)) / time.Second


We don't need to / time.Second here I think.

Yes. Updated

Dynamic replication batch size

3775f61

yux0 requested review from emrahs, mkolodezny and a team July 7, 2021 18:35

Merge branch 'master' into dynamic-batch-size

0c63981

vytautas-karpavicius reviewed Jul 8, 2021

View reviewed changes

demirkayaender reviewed Jul 8, 2021

View reviewed changes

yux0 added 4 commits July 9, 2021 14:41

Merge branch 'master' into dynamic-batch-size

4eda126

respond to comments

4b68fff

Update ack manager unit test

63bc038

Merge branch 'master' into dynamic-batch-size

a994d71

yycptt reviewed Aug 16, 2021

View reviewed changes

yux0 added 6 commits August 17, 2021 12:01

Merge branch 'master' into dynamic-batch-size

adb245a

address comments

b802470

Merge branch 'dynamic-batch-size' of https://github.com/yux0/cadence …

06ffb4e

…into dynamic-batch-size

Merge branch 'master' into dynamic-batch-size

a32e7ea

Merge branch 'master' into dynamic-batch-size

afddd1e

Merge branch 'master' into dynamic-batch-size

94a6acc

yycptt reviewed Aug 27, 2021

View reviewed changes

common/dynamicconfig/constants.go Outdated Show resolved Hide resolved

common/dynamicconfig/constants.go Outdated Show resolved Hide resolved

yux0 added 2 commits August 27, 2021 14:41

update dynamic config

0e9fa9e

Merge branch 'dynamic-batch-size' of https://github.com/yux0/cadence …

1a3e983

…into dynamic-batch-size

yycptt approved these changes Aug 27, 2021

View reviewed changes

Update dynamic config

2c1c8db

yux0 merged commit cde0f41 into cadence-workflow:master Aug 27, 2021

yux0 deleted the dynamic-batch-size branch August 27, 2021 22:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic replication batch size #4301

Dynamic replication batch size #4301

yux0 commented Jul 7, 2021

coveralls commented Jul 7, 2021 •

edited

Loading

vytautas-karpavicius Jul 8, 2021

demirkayaender Jul 8, 2021

yux0 Jul 14, 2021

yycptt left a comment

yycptt Aug 27, 2021

yycptt Aug 27, 2021

yycptt Aug 27, 2021

yux0 Aug 27, 2021

Dynamic replication batch size #4301

Dynamic replication batch size #4301

Conversation

yux0 commented Jul 7, 2021

coveralls commented Jul 7, 2021 • edited Loading

Pull Request Test Coverage Report for Build 2a76fb8d-1110-483a-acad-b0b1184ceb47

💛 - Coveralls

vytautas-karpavicius Jul 8, 2021

Choose a reason for hiding this comment

demirkayaender Jul 8, 2021

Choose a reason for hiding this comment

yux0 Jul 14, 2021

Choose a reason for hiding this comment

yycptt left a comment

Choose a reason for hiding this comment

yycptt Aug 27, 2021

Choose a reason for hiding this comment

yycptt Aug 27, 2021

Choose a reason for hiding this comment

yycptt Aug 27, 2021

Choose a reason for hiding this comment

yux0 Aug 27, 2021

Choose a reason for hiding this comment

coveralls commented Jul 7, 2021 •

edited

Loading