Batch adding series to query limiter to optimize locking #5505

harry671003 · 2023-08-09T21:58:09Z

What this PR does:

When streaming data from ingesters, a lot of time can be spend on waiting for the QueryLimiter lock to be acquired. This change optimizes this a bit by allowing a batch of series to be added at once.

Benchmarks

I saw some improvements in the benchmarks:

Without batching

(base) ➜  cortex git:(optimize_limiter) go test -bench=^BenchmarkQueryLimiter_AddSeries$ -benchmem -benchtime=200000x github.com/cortexproject/cortex/pkg/util/limiter -count 10          
goos: darwin
goarch: arm64
pkg: github.com/cortexproject/cortex/pkg/util/limiter
BenchmarkQueryLimiter_AddSeries-10        200000             27349 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             29049 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             30271 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             32509 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             33181 ns/op           31727 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             32375 ns/op           31727 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             30465 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             32754 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             33264 ns/op           31728 B/op        599 allocs/op
BenchmarkQueryLimiter_AddSeries-10        200000             31569 ns/op           31728 B/op        599 allocs/op
PASS
ok      github.com/cortexproject/cortex/pkg/util/limiter        63.644s

With batching

(base) ➜  cortex git:(optimize_limiter) go test -bench=^BenchmarkQueryLimiter_AddSeriesBatch$ -benchmem -benchtime=200000x github.com/cortexproject/cortex/pkg/util/limiter -count 10
goos: darwin
goarch: arm64
pkg: github.com/cortexproject/cortex/pkg/util/limiter
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             11871 ns/op           31729 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             10766 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             10934 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             11575 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             11615 ns/op           31729 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             10766 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             11049 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             10903 ns/op           31728 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             10601 ns/op           31729 B/op        401 allocs/op
BenchmarkQueryLimiter_AddSeriesBatch-10           200000             11080 ns/op           31728 B/op        401 allocs/op
PASS
ok      github.com/cortexproject/cortex/pkg/util/limiter        23.073s

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

yeya24 · 2023-08-09T22:22:51Z

pkg/util/limiter/query_limiter.go

+	for _, s := range series {
+		fingerprint := client.FastFingerprint(s)
+		ql.uniqueSeries[fingerprint] = struct{}{}
+	}


I am okay with this but I am also wondering if we can calculate the hashes first before we lock.
This requires to allocate a slice of int64 so not sure if it is worth. Probably we need some benchmark here

I did see some improvement when hashing the series outside the lock. I'll implement your suggestion.

@harry671003 Can you run the benchmark again? And let's also add -benchmem to show memory allocation?

# old.out - Without the slice of fps # new.out - With the slice of fps > benchstat old.out new.out goos: darwin goarch: arm64 pkg: github.com/cortexproject/cortex/pkg/util/limiter │ old.out │ new.out │ │ sec/op │ sec/op vs base │ QueryLimiter_AddSeriesBatch-10 12.97µ ± 7% 11.39µ ± 3% -12.20% (p=0.000 n=10) │ old.out │ new.out │ │ B/op │ B/op vs base │ QueryLimiter_AddSeriesBatch-10 30.20Ki ± 0% 30.98Ki ± 0% +2.59% (p=0.000 n=10) │ old.out │ new.out │ │ allocs/op │ allocs/op vs base │ QueryLimiter_AddSeriesBatch-10 400.0 ± 0% 401.0 ± 0% +0.25% (p=0.000 n=10)

Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com>

harry671003 · 2023-08-23T19:37:29Z

@alanprot @friedrichg Could you take a look please?

alanprot · 2023-08-23T20:15:45Z

pkg/util/limiter/query_limiter.go

-// AddSeries adds the input series and returns an error if the limit is reached.
-func (ql *QueryLimiter) AddSeries(seriesLabels []cortexpb.LabelAdapter) error {
+// AddSeriesBatch adds the batch of input series and returns an error if the limit is reached.
+func (ql *QueryLimiter) AddSeries(series ...[]cortexpb.LabelAdapter) error {


Just a small nit:

Can we receive an [][]cortexpb.LabelAdapter instaed of ...[]cortexpb.LabelAdapter to avoid copying this slice over?

From go doc:

If f is variadic with a final parameter p of type ...T, then within f the type of p is equivalent to type []T. If f is invoked with no actual arguments for p, the value passed to p is nil. Otherwise, the value passed is a new slice of type []T with a new underlying array whose successive elements are the actual arguments, which all must be assignable to T. The length and capacity of the slice is therefore the number of arguments bound to p and may differ for each call site.

Correct me if I'm wrong. The doc is saying that if we pass a slice into a variadic function, it'll not create a new array right?

If the final argument is assignable to a slice type []T and is followed by ..., it is passed unchanged as the value for a ...T parameter. In this case no new slice is created.

series := [][]cortexpb.LabelAdapter{s1, s2} AddSeries(series...) // This will not create a new underlying array.

alanprot · 2023-08-23T20:16:05Z

This looks great! Just a small nit!

LGTM.

pull-request-size bot added the size/L label Aug 9, 2023

harry671003 changed the title ~~Optimize query limiter by allowing to add batch of series~~ Batch adding series to query limiter to optimize locking Aug 9, 2023

yeya24 reviewed Aug 9, 2023

View reviewed changes

harry671003 force-pushed the optimize_limiter branch 2 times, most recently from 3d9eaef to cfd6140 Compare August 10, 2023 21:32

Batch adding series to query limiter to optimize locks

e31817d

Signed-off-by: 🌲 Harry 🌊 John 🏔 <johrry@amazon.com>

harry671003 force-pushed the optimize_limiter branch from cfd6140 to e31817d Compare August 11, 2023 17:23

yeya24 approved these changes Aug 11, 2023

View reviewed changes

alanprot reviewed Aug 23, 2023

View reviewed changes

Merge remote-tracking branch 'origin/master' into optimize_limiter

3bfbcc8

yeya24 merged commit f560115 into cortexproject:master Sep 14, 2023
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch adding series to query limiter to optimize locking #5505

Batch adding series to query limiter to optimize locking #5505

harry671003 commented Aug 9, 2023 •

edited

Loading

yeya24 Aug 9, 2023

harry671003 Aug 10, 2023

yeya24 Aug 10, 2023

harry671003 Aug 11, 2023 •

edited

Loading

harry671003 commented Aug 23, 2023

alanprot Aug 23, 2023

harry671003 Sep 13, 2023

alanprot commented Aug 23, 2023

Batch adding series to query limiter to optimize locking #5505

Batch adding series to query limiter to optimize locking #5505

Conversation

harry671003 commented Aug 9, 2023 • edited Loading

What this PR does:

Benchmarks

Without batching

With batching

yeya24 Aug 9, 2023

Choose a reason for hiding this comment

harry671003 Aug 10, 2023

Choose a reason for hiding this comment

yeya24 Aug 10, 2023

Choose a reason for hiding this comment

harry671003 Aug 11, 2023 • edited Loading

Choose a reason for hiding this comment

harry671003 commented Aug 23, 2023

alanprot Aug 23, 2023

Choose a reason for hiding this comment

harry671003 Sep 13, 2023

Choose a reason for hiding this comment

alanprot commented Aug 23, 2023

harry671003 commented Aug 9, 2023 •

edited

Loading

harry671003 Aug 11, 2023 •

edited

Loading