Cache label IDs #1414

niksajakovljevic · 2022-06-06T10:19:04Z

Add inverted cache ( (metric + label pair) -> (id, pos)) to avoid DB calls for
fetching label IDs in cases when series ID is not cached.
This cache is only used on metric ingestion.
Benchmarks are showing around 5-10% gains in ingest performance and
about 25% less DB calls for fetching label IDs (note that these numbers
depend a lot on a shape of the dataset).

niksajakovljevic · 2022-06-06T14:06:21Z

Closes #1392

cevian · 2022-06-06T14:21:39Z

@niksajakovljevic have you considered using (label pair)=> (id, map[metric_name]=>pos) instead? That would allow reusing the existing cache and thus improve overall cache hit ratio?

Harkishen-Singh

Why are we not using clockcache?

pkg/pgclient/config.go

pkg/pgmodel/cache/inverted_labels_cache.go

pkg/pgmodel/ingestor/series_writer.go

niksajakovljevic · 2022-06-15T12:24:50Z

@niksajakovljevic have you considered using (label pair)=> (id, map[metric_name]=>pos) instead? That would allow reusing the existing cache and thus improve overall cache hit ratio?

Labels reader cache is actually inverse id -> label pair so we can't reuse it.

antekresic · 2022-06-17T10:59:48Z

pkg/pgmodel/ingestor/series_writer.go

+		for _, cachedLabel := range info.cachedLabels {
+			if val, ok := labelMap[cachedLabel]; ok {
+				if int(val.Pos) > info.maxPos {
+					info.maxPos = int(val.Pos)


Can we do this when fetching from cache?

Yeah we have to do it. Since some labels are cached and we need maxPos to be correct (meaning it contains the max for both fetched and cached labels)

No, I meant, can you start calculating the max position when fetching cached entries, so you don't have to iterate again through the cached labels.

pkg/pgmodel/ingestor/series_writer.go

pkg/pgmodel/cache/flags.go

pkg/pgmodel/cache/inverted_labels_cache.go

Add inverted cache ( (metric + label pair) -> (id, pos)) to avoid DB calls for fetching label IDs in cases when series ID is not cached. This cache is only used when ingesting data. Benchmarks are showing around 5-10% gains in ingest performance and about 25% less DB calls for fetching label IDs (note that these numbers depend a lot on a shape of the dataset).

niksajakovljevic added the Performance Improvements that are specifically related to performance label Jun 6, 2022

niksajakovljevic requested a review from antekresic as a code owner June 6, 2022 10:19

niksajakovljevic self-assigned this Jun 6, 2022

niksajakovljevic requested review from paulfantom and a team as code owners June 6, 2022 10:19

niksajakovljevic requested a review from Harkishen-Singh June 6, 2022 10:19

niksajakovljevic force-pushed the niksa/cache-label-ids branch 2 times, most recently from ce7ae00 to 23147d4 Compare June 6, 2022 14:04

niksajakovljevic force-pushed the niksa/cache-label-ids branch from 23147d4 to 7377014 Compare June 6, 2022 14:50

Harkishen-Singh suggested changes Jun 7, 2022

View reviewed changes

niksajakovljevic force-pushed the niksa/cache-label-ids branch 7 times, most recently from ab8d180 to 91e629c Compare June 16, 2022 12:30

niksajakovljevic requested a review from Harkishen-Singh June 16, 2022 12:42

niksajakovljevic added this to the Improve Installation and Ingest Performance I milestone Jun 17, 2022

antekresic reviewed Jun 17, 2022

View reviewed changes

niksajakovljevic requested a review from antekresic June 17, 2022 11:19

niksajakovljevic force-pushed the niksa/cache-label-ids branch from 91e629c to 49ca9a7 Compare June 20, 2022 09:20

antekresic approved these changes Jun 20, 2022

View reviewed changes

Harkishen-Singh suggested changes Jun 20, 2022

View reviewed changes

pkg/pgmodel/cache/flags.go Outdated Show resolved Hide resolved

pkg/pgmodel/cache/flags.go Outdated Show resolved Hide resolved

pkg/pgmodel/cache/inverted_labels_cache.go Outdated Show resolved Hide resolved

niksajakovljevic force-pushed the niksa/cache-label-ids branch from 49ca9a7 to fafbd69 Compare June 20, 2022 10:12

niksajakovljevic requested a review from Harkishen-Singh June 20, 2022 10:13

Harkishen-Singh approved these changes Jun 20, 2022

View reviewed changes

niksajakovljevic force-pushed the niksa/cache-label-ids branch from fafbd69 to 2396c0e Compare June 20, 2022 11:11

niksajakovljevic merged commit 8ba45d2 into master Jun 20, 2022

niksajakovljevic deleted the niksa/cache-label-ids branch June 20, 2022 11:47

niksajakovljevic mentioned this pull request Jul 11, 2022

Add caching for label IDs #1392

Closed

peppercoffee added the IIP-1 Improve Ingestion Performance (part 1) label Jul 19, 2022

peppercoffee modified the milestones: Improve Installation and Ingest Performance I, 0.13.0 Jul 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache label IDs #1414

Cache label IDs #1414

niksajakovljevic commented Jun 6, 2022

niksajakovljevic commented Jun 6, 2022

cevian commented Jun 6, 2022

Harkishen-Singh left a comment •

edited

Loading

niksajakovljevic commented Jun 15, 2022

antekresic Jun 17, 2022

niksajakovljevic Jun 17, 2022

antekresic Jun 20, 2022

Cache label IDs #1414

Cache label IDs #1414

Conversation

niksajakovljevic commented Jun 6, 2022

niksajakovljevic commented Jun 6, 2022

cevian commented Jun 6, 2022

Harkishen-Singh left a comment • edited Loading

Choose a reason for hiding this comment

niksajakovljevic commented Jun 15, 2022

antekresic Jun 17, 2022

Choose a reason for hiding this comment

niksajakovljevic Jun 17, 2022

Choose a reason for hiding this comment

antekresic Jun 20, 2022

Choose a reason for hiding this comment

Harkishen-Singh left a comment •

edited

Loading