Prewarm compute nodes #4828

bojanserafimov · 2023-07-27T16:58:09Z

Problem

After deploying the sync safekeepers hot path, total startup improved, but postgres startup p90 in us-east-2 got worse. I'm not sure but seems that it's only for VMs. That's the only region that has VMs, and they take about 10% of starts, and only p90 starts are impacted. Grepping logs also confirms it's mostly VMs.

Now the question is, why does postgres start faster on VMs if we run walproposer.c before it? It's possible that walproposer warms up the vm:

qemu allocates ram lazily, so walproposer breaks the ice for postgres
our binaries are large, and walproposer puts them in os cache
???

Summary of changes

When we start compute_ctl in pool mode, I run initdb and postgres to warm it up. I'm not sure if this will have an effect but it's easy to test on stage. If it doesn't work it's easy to revert.

github-actions · 2023-07-27T17:20:18Z

1240 tests run: 1190 passed, 0 failed, 50 skipped (full report)

bojanserafimov · 2023-07-27T17:40:13Z

Another mystery: why does postgres start time depend so much on requester cc @ololobus in case it's obvious to you. Maybe it's a correlation, but idk what it would correlate with

ololobus

Wow, that's a peculiar theory :) I'd love to see more proofs / investigation, but should be also fine just to try, though

compute_tools/src/bin/compute_ctl.rs

compute_tools/src/compute.rs

ololobus · 2023-07-27T19:43:43Z

Another mystery: why does postgres start time depend so much on [requester] cc @\ololobus in case it's obvious to you. Maybe it's a correlation, but idk what it would correlate with

Several correlations you should be aware of:

endpoint_api - is explicit API call to start endpoint OR create new endpoint on existing branch without endpoint (or add RO), I'm pretty sure that 99% of these calls are ours and come from e2e tests
create_branch - is start on some new timeline, it could be that on start Postgres does some get page requests, and maybe Pageserver performs better when it's a fresh timeline? (I was expecting the opposite, though)
proxy - as it's staging, this is the only one, who could be related to some wake ups for periodic perf tests we do on staging. Alexander, Lassi and Artur all have their fleet of computes, so older compute may start slower? (again if Postgres does enough get page requests)

Just random thoughts, cannot give any good explanation

bojanserafimov · 2023-07-28T00:37:36Z

Wow, that's a peculiar theory :) I'd love to see more proofs / investigation, but should be also fine just to try, though

One more data point: I found out that on staging walproposer full sync actually runs on endpoint_api and create_branch requests. And that's exactly the group of postgres starts that didn't regress :)

Off topic: for this reason I also switched the "inside pod breakdown" panel to only show proxy requests. IMO makes sense to de-prioritize create_branch, as it's the same category thing as create_project

ololobus

I don't see any immediate issues with it, if you want to run experiments on stgaing :)

compute_tools/src/compute.rs

bojanserafimov · 2023-07-31T20:05:24Z

Now that we know this works, let's reopen the question of "avoid binding to vms that are busy prewarming". Do you think I should add a new ComputeStatus variant to deal with the "prewarming" case, or maybe avoid taking http requests until we're done prewarming? Or maybe just let cplane figure it out (prefer older VMs to newer ones)

ololobus · 2023-08-01T14:54:14Z

Do you think I should add a new ComputeStatus variant to deal with the "prewarming" case

If there will be a state before empty this will work

or maybe avoid taking http requests until we're done prewarming?

That's the easiest one, I guess, just don't start http server until prewarming is finished

bojanserafimov added 2 commits July 27, 2023 12:40

Prewarm compute node

c148f19

Add todos

e707fba

bojanserafimov requested review from ololobus and sharnoff July 27, 2023 16:58

ololobus reviewed Jul 27, 2023

View reviewed changes

compute_tools/src/bin/compute_ctl.rs Show resolved Hide resolved

compute_tools/src/compute.rs Outdated Show resolved Hide resolved

Fix todos

bf42201

bojanserafimov marked this pull request as ready for review July 28, 2023 14:33

bojanserafimov requested a review from a team as a code owner July 28, 2023 14:33

ololobus approved these changes Jul 31, 2023

View reviewed changes

compute_tools/src/compute.rs Show resolved Hide resolved

bojanserafimov merged commit ddbe170 into main Jul 31, 2023

bojanserafimov deleted the prewarm-compute branch July 31, 2023 18:13

bojanserafimov mentioned this pull request Aug 1, 2023

compute_ctl: Prewarm before starting http server #4867

Merged

ololobus mentioned this pull request Apr 3, 2024

RFC Merged compute image #7289

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prewarm compute nodes #4828

Prewarm compute nodes #4828

bojanserafimov commented Jul 27, 2023 •

edited

Loading

github-actions bot commented Jul 27, 2023 •

edited

Loading

bojanserafimov commented Jul 27, 2023

ololobus left a comment

ololobus commented Jul 27, 2023

bojanserafimov commented Jul 28, 2023 •

edited

Loading

ololobus left a comment

bojanserafimov commented Jul 31, 2023

ololobus commented Aug 1, 2023

Prewarm compute nodes #4828

Prewarm compute nodes #4828

Conversation

bojanserafimov commented Jul 27, 2023 • edited Loading

Problem

Summary of changes

github-actions bot commented Jul 27, 2023 • edited Loading

1240 tests run: 1190 passed, 0 failed, 50 skipped (full report)

bojanserafimov commented Jul 27, 2023

ololobus left a comment

Choose a reason for hiding this comment

ololobus commented Jul 27, 2023

bojanserafimov commented Jul 28, 2023 • edited Loading

ololobus left a comment

Choose a reason for hiding this comment

bojanserafimov commented Jul 31, 2023

ololobus commented Aug 1, 2023

bojanserafimov commented Jul 27, 2023 •

edited

Loading

github-actions bot commented Jul 27, 2023 •

edited

Loading

bojanserafimov commented Jul 28, 2023 •

edited

Loading