lots of time wasted on `count_deltas()` #6861

problame · 2024-02-21T15:45:27Z

Problem

original thread: https://neondb.slack.com/archives/C033RQ5SPDH/p1708513450565049

Now that the flamegraphs are fixed, I took one on ps-2 ap-southeast-1 to investigate the elevanted CPU usage after enabling tokio-epoll-uring there.
That investigation isn't the subject of this thread though, but, the general finding of where that PS is spending its time.
LayerMap::count_deltas inside time_for_new_image_layer completely dominates the CPU usage there.
AFAICT that is called for every tenant, even if the layer map hasn't changed.

This is wasteful.

Solution

If the layer map and partitioning is the same as in an earlier call, early-exit in time_for_new_image_layer to avoid the call to count_deltas().

Tasks

Give feedback

remove gc_feedback mechanism #6863
Timeline::repartition: enforce no concurrent callers & lsn to not move backwards #6862
feat(compaction): avoid count_deltas() call if nothing changed #6868
pageserver: check for new image layers based on ingested WAL #7230
find staging example that shows similar pattern
Options

The text was updated successfully, but these errors were encountered:

…ove backwards This PR enforces aspects of `Timeline::repartition` that were already true at runtime: - it's not called concurrently, so, bail out if it is anyway (see comment why it's not called concurrently) - the `lsn` should never be moving backwards over the lifetime of a Timeline object, because last_record_lsn() can only move forwards over the lifetime of a Timeline object part of #6861

It's been dead-code-at-runtime for 9 months, let's remove it. We can always re-introduce it at a later point. Came across this while working on #6861, which will touch `time_for_new_image_layer`. This is an opporunity to make that function simpler.

hlinnaka · 2024-02-22T09:24:54Z

The new compaction code in #6830 no longer calls count_deltas. (It needs testing to see if it introduces other problems of course)

problame · 2024-02-22T11:32:51Z

Yeah, aware, @arpad-m is going to work on compaction, but, it'll be many more weeks until it lands, I think.

It's been dead-code-at-runtime for 9 months, let's remove it. We can always re-introduce it at a later point. Came across this while working on #6861, which will touch `time_for_new_image_layer`. This is an opporunity to make that function simpler.

…e backwards (#6862) This PR enforces aspects of `Timeline::repartition` that were already true at runtime: - it's not called concurrently, so, bail out if it is anyway (see comment why it's not called concurrently) - the `lsn` should never be moving backwards over the lifetime of a Timeline object, because last_record_lsn() can only move forwards over the lifetime of a Timeline object The switch to tokio::sync::Mutex blows up the size of the `partitioning` field from 40 bytes to 72 bytes on Linux x86_64. That would be concerning if it was a hot field, but, `partitioning` is only accessed every 20s by one task, so, there won't be excessive cache pain on it. (It still sucks that it's now >1 cache line, but I need the Send-able MutexGuard in the next PR) part of #6861

problame · 2024-03-25T11:26:49Z

@VladLazar just in case you didn't see it, my PR to avoid count_deltas() is here: #6868

Feel free to take it over

VladLazar · 2024-04-02T08:35:59Z

Update:

WAL based solution merged and will be released this week (2024-04-02)
Need to check things have improved post release. Keeping the ticket open until then.

VladLazar · 2024-04-08T09:41:45Z

Looks like #7230 helped here. Generated another flamegraph this morning and it's not exhibiting the original issue:

(ask me if you want the svg - can't add it here for some reason)

problame added c/storage/pageserver Component: storage: pageserver a/performance Area: relates to performance of the system labels Feb 21, 2024

problame mentioned this issue Feb 21, 2024

Timeline::repartition: enforce no concurrent callers & lsn to not move backwards #6862

Merged

This was referenced Feb 21, 2024

remove gc_feedback mechanism #6863

Merged

feat(compaction): avoid count_deltas() call if nothing changed #6868

Closed

VladLazar self-assigned this Mar 25, 2024

VladLazar closed this as completed Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lots of time wasted on `count_deltas()` #6861

lots of time wasted on `count_deltas()` #6861

problame commented Feb 21, 2024 •

edited

Loading

Tasks

hlinnaka commented Feb 22, 2024

problame commented Feb 22, 2024

problame commented Mar 25, 2024

VladLazar commented Apr 2, 2024

VladLazar commented Apr 8, 2024 •

edited

Loading

lots of time wasted on count_deltas() #6861

lots of time wasted on count_deltas() #6861

Comments

problame commented Feb 21, 2024 • edited Loading

Problem

Solution

Tasks

Tasks

hlinnaka commented Feb 22, 2024

problame commented Feb 22, 2024

problame commented Mar 25, 2024

VladLazar commented Apr 2, 2024

VladLazar commented Apr 8, 2024 • edited Loading

lots of time wasted on `count_deltas()` #6861

lots of time wasted on `count_deltas()` #6861

problame commented Feb 21, 2024 •

edited

Loading

VladLazar commented Apr 8, 2024 •

edited

Loading