Add support for caching based on compaction level and block age #805

annanay25 · 2021-07-07T14:48:46Z

What this PR does:
Adds support for caching based on compaction level and max block age. Ideally, we want to cache bloom filters of blocks that will be around longer - so blocks which reach higher levels of compaction soon.

Which issue(s) this PR fixes:
Fixes #na!

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

…size histogram Signed-off-by: Annanay <annanayagarwal@gmail.com>

Signed-off-by: Annanay <annanayagarwal@gmail.com>

…m shard counters Signed-off-by: Annanay <annanayagarwal@gmail.com>

Signed-off-by: Annanay <annanayagarwal@gmail.com>

… level Signed-off-by: Annanay <annanayagarwal@gmail.com>

Signed-off-by: Annanay <annanayagarwal@gmail.com>

annanay25 · 2021-07-16T15:11:12Z

There's one last thing left to implement: remove any caching while writing objects at the ingester. At the compactor we can have a similar shouldCache function as implemented for querier.

joe-elliott · 2021-07-16T17:13:10Z

Don't we need to merge #818 first before this?

remove any caching while writing objects at the ingester

Why wouldn't we cache objects written by the ingester?

annanay25 · 2021-07-19T07:26:10Z

Don't we need to merge #818 first before this?

That's right. 818 is cleaned up and ready to review.

Why wouldn't we cache objects written by the ingester?

My bad, not remove caching, the write methods on ingester/compactor will use ShouldCache() to decide whether the objects should be cached.

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

docs/tempo/website/operations/caching.md

tempodb/tempodb.go

yvrhdn · 2021-08-06T15:18:29Z

tempodb/tempodb.go

+	uncachedReader backend.Reader
+	uncachedWriter backend.Writer


It would be nice if we could shield off direct access to r, w, uncachedReader and uncachedWriter. So you are always forced to call getReaderForBlock and getWriterForBlock.
Unfortunately we can't shield private fields because everything is in the same package.

I think we should either add a comment to warn you or somehow combine these fields into a new struct that knows what to cache.

When I search for usage of r and w I still find some code using them directly (so without getXxxForBlock). I don't know if this is okay.

r:

compactor.go:154 (compact)

compactor.go:160 (compact)

tempodb.go:208 (CompleteBlock)

w:

compactor.go:251 (appendBlock)

tempodb.go:208 (CompleteBlock)

Some cases are unavoidable, such as compactor.go:154 to load the block meta from the backend, so the fields will have to remain accessible. compactor:160 and :251 are ok because they are iterating or flushing data, which does not involve the bloom filter. tempodb.go:208 actually looks like obsolete code only used by a querier test. The ingester used to call this when iterating the wal, but now it calls CompleteBlockWithBackend.

Why is compactor.go:154 unavoidable? Using r directly means we read using the cache, what's the difference with using the reader we get from getReaderForBlock?

You're right. I was thinking that there will be cases where the meta is not available, i.e. a call to get the meta, but in this case the compactor does already have it.

yvrhdn

Overall changes look good and are easy to follow. It's fine for me to merge now and address the comments later.

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

annanay25 added 6 commits June 24, 2021 15:44

Checkpoint: Compaction summary enhancements, add block shard count & …

208cf27

…size histogram Signed-off-by: Annanay <annanayagarwal@gmail.com>

Experimenting with 2d map of size vs time for blocks

6d146ae

Signed-off-by: Annanay <annanayagarwal@gmail.com>

Clean out histogram library, experiment with per day + per level bloo…

e90bef1

…m shard counters Signed-off-by: Annanay <annanayagarwal@gmail.com>

Actual support for caching based on level and age

9fc7179

Signed-off-by: Annanay <annanayagarwal@gmail.com>

Merge branch 'main' into caching-strategy

1b1ef33

Signed-off-by: Annanay <annanayagarwal@gmail.com>

Fix bucket reader

07d6562

Signed-off-by: Annanay <annanayagarwal@gmail.com>

annanay25 mentioned this pull request Jul 15, 2021

Add shouldCache param to backend interface read/write methods #818

Merged

1 task

annanay25 added 2 commits July 16, 2021 16:45

Add new cache summary command to view bloom filter shards per day per…

dcc379b

… level Signed-off-by: Annanay <annanayagarwal@gmail.com>

Add docs for new cache summary command and cache estimation

94e5a30

Signed-off-by: Annanay <annanayagarwal@gmail.com>

annanay25 marked this pull request as ready for review July 16, 2021 15:09

annanay25 requested review from achatterjee-grafana, dgzlopes, joe-elliott, mapno and mdisibio as code owners July 16, 2021 15:09

mdisibio added 5 commits August 4, 2021 15:31

Tweak/simplify verbiage

e1a851b

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

Change var names, logic for clarity

cf05977

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

changelog

439fcb9

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

Merge branch 'main' into caching-strategy

3b70a8b

Update ingester and compactor to use cache settings when writing blocks

5ad2821

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

yvrhdn reviewed Aug 6, 2021

View reviewed changes

This comment has been minimized.

Sign in to view

yvrhdn approved these changes Aug 6, 2021

View reviewed changes

mdisibio added 2 commits August 6, 2021 13:28

Fixed docs links and image

ec291ee

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

Comments from review

fe37939

Signed-off-by: Martin Disibio <mdisibio@gmail.com>

mdisibio approved these changes Aug 6, 2021

View reviewed changes

mdisibio merged commit db1209f into grafana:main Aug 6, 2021

annanay25 deleted the caching-strategy branch August 9, 2021 06:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for caching based on compaction level and block age #805

Add support for caching based on compaction level and block age #805

annanay25 commented Jul 7, 2021 •

edited by mdisibio

Loading

annanay25 commented Jul 16, 2021

joe-elliott commented Jul 16, 2021

annanay25 commented Jul 19, 2021

yvrhdn Aug 6, 2021

mdisibio Aug 6, 2021 •

edited

Loading

yvrhdn Aug 6, 2021

mdisibio Aug 6, 2021

This comment has been minimized.

yvrhdn left a comment

Add support for caching based on compaction level and block age #805

Add support for caching based on compaction level and block age #805

Conversation

annanay25 commented Jul 7, 2021 • edited by mdisibio Loading

annanay25 commented Jul 16, 2021

joe-elliott commented Jul 16, 2021

annanay25 commented Jul 19, 2021

yvrhdn Aug 6, 2021

Choose a reason for hiding this comment

mdisibio Aug 6, 2021 • edited Loading

Choose a reason for hiding this comment

yvrhdn Aug 6, 2021

Choose a reason for hiding this comment

mdisibio Aug 6, 2021

Choose a reason for hiding this comment

This comment has been minimized.

yvrhdn left a comment

Choose a reason for hiding this comment

annanay25 commented Jul 7, 2021 •

edited by mdisibio

Loading

mdisibio Aug 6, 2021 •

edited

Loading