docs(rfc): Independent compute release flow #8881

ololobus · 2024-08-30T17:22:20Z

Related to https://github.com/neondatabase/cloud/issues/11698

github-actions · 2024-08-30T18:21:06Z

4986 tests run: 4822 passed, 0 failed, 164 skipped (full report)

Flaky tests (9)

Postgres 17

test_pageserver_compaction_smoke: release-x86-64
test_pg_regress[4]: debug-x86-64
test_timeline_archive[4]: debug-x86-64
test_restart_endpoint_after_switch_wal: release-x86-64

Postgres 16

test_slots_and_branching: release-arm64

Postgres 15

test_neon_cli_basics: release-arm64

Postgres 14

test_lfc_resize: release-arm64
test_neon_cli_basics: release-arm64
test_subscriber_restart: release-x86-64

Code coverage* (full report)

functions: 32.1% (7455 of 23225 functions)
lines: 49.9% (60101 of 120333 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
1939b99 at 2024-09-24T00:02:32.818Z :recycle:}

hlinnaka

Overall, I like it

docs/rfcs/038-independent-compute-release.md

bayandin

Should we adjust the compute pool logic for the proposed release changes? Probably not, but asking just in case

docs/rfcs/038-independent-compute-release.md

mtyazici

LGTM as well, I am a bit hesitant to the alternative Helm approach though.

docs/rfcs/038-independent-compute-release.md

ololobus · 2024-09-05T18:31:55Z

Should we adjust the compute pool logic for the proposed release changes? Probably not, but asking just in case

For now, we shouldn't. With this v1 release flow it's expected that we will start all new computes in the pool with a new version, while we still re-utilize old pre-created ones, if there is a shortage of new computes. Later, yes, we would need to teach pools to maintain pools with both versions if we want kinda canary deployments within the same region / control plane

docs/rfcs/038-independent-compute-release.md

problame

Overall plus one, especially the compute_releases table will be a huge step forward.

I'm concerned about the ignorance in this PR about merge strategies, see my comments on that. I think it needs a bit more forethought.

Maybe I missed it, but, I think there should be a "Unique Selling Point" section outlining why we want the compute_releases table. I know from our 1:1 discussions, but, it's not written down here. IIRC that was

ability to prewarm pools with new image to avoid misses
- especially important once we do slow rollout strategies where pools will need to maintain pre-warmed computes for both old and new image for the duration of the rollout
extension stuff (I forgot the details, we talked about it 1:1 like 3-4 months ago)

docs/rfcs/038-independent-compute-release.md

ololobus · 2024-09-06T13:26:24Z

especially important once we do slow rollout strategies where pools will need to maintain pre-warmed computes for both old and new image for the duration of the rollout

I didn't want this to be part of v1 because it requires a lot of changes on the control plane side, but I briefly mentioned that in the Further work section https://github.com/neondatabase/neon/pull/8881/files#diff-39df61b534dc5f736661bd5f139140b573a0f35c32524866e0339e425b9f22e4R299

extension stuff

Yeah, I mentioned that too, but waiting for the Anastasia's input here, as I don't know this part well enough yet #8881 (comment). I only know that we need some metadata in cplane and compute to make remote extensions working, but don't know how exactly it's produced

docs/rfcs/038-independent-compute-release.md

ololobus force-pushed the alexk/rfc-compute-release branch from 04575bd to 2d0491b Compare September 2, 2024 19:13

ololobus marked this pull request as ready for review September 2, 2024 19:14

ololobus force-pushed the alexk/rfc-compute-release branch from 2d0491b to 6ff8ff3 Compare September 2, 2024 19:15

ololobus requested review from lubennikovaav, mtyazici, nikitakalyanov and bayandin September 2, 2024 19:17

hlinnaka approved these changes Sep 2, 2024

View reviewed changes

bayandin reviewed Sep 2, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Outdated Show resolved Hide resolved

bayandin approved these changes Sep 2, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

mtyazici approved these changes Sep 3, 2024

View reviewed changes

bayandin reviewed Sep 3, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

ololobus force-pushed the alexk/rfc-compute-release branch 2 times, most recently from 44fb3be to 0ad928d Compare September 5, 2024 18:18

ololobus commented Sep 5, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

problame reviewed Sep 6, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

lubennikovaav reviewed Sep 9, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

docs/rfcs/038-independent-compute-release.md Show resolved Hide resolved

lubennikovaav reviewed Sep 9, 2024

View reviewed changes

docs/rfcs/038-independent-compute-release.md Outdated Show resolved Hide resolved

lubennikovaav mentioned this pull request Sep 23, 2024

Neon supported extensions RFC #9112

Draft

ololobus added 2 commits September 23, 2024 22:25

docs(rfc): Independent compute release flow

a387e39

Review changes

1939b99

ololobus force-pushed the alexk/rfc-compute-release branch from 0ad928d to 1939b99 Compare September 23, 2024 20:25

lubennikovaav approved these changes Sep 24, 2024

View reviewed changes

ololobus merged commit 518f598 into main Sep 25, 2024
79 checks passed

ololobus deleted the alexk/rfc-compute-release branch September 25, 2024 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(rfc): Independent compute release flow #8881

docs(rfc): Independent compute release flow #8881

ololobus commented Aug 30, 2024 •

edited

Loading

github-actions bot commented Aug 30, 2024 •

edited

Loading

Postgres 17

Postgres 16

Postgres 15

Postgres 14

hlinnaka left a comment

bayandin left a comment

mtyazici left a comment

ololobus commented Sep 5, 2024 •

edited

Loading

problame left a comment

ololobus commented Sep 6, 2024

docs(rfc): Independent compute release flow #8881

docs(rfc): Independent compute release flow #8881

Conversation

ololobus commented Aug 30, 2024 • edited Loading

github-actions bot commented Aug 30, 2024 • edited Loading

4986 tests run: 4822 passed, 0 failed, 164 skipped (full report)

Postgres 17

Postgres 16

Postgres 15

Postgres 14

Code coverage* (full report)

hlinnaka left a comment

Choose a reason for hiding this comment

bayandin left a comment

Choose a reason for hiding this comment

mtyazici left a comment

Choose a reason for hiding this comment

ololobus commented Sep 5, 2024 • edited Loading

problame left a comment

Choose a reason for hiding this comment

ololobus commented Sep 6, 2024

ololobus commented Aug 30, 2024 •

edited

Loading

github-actions bot commented Aug 30, 2024 •

edited

Loading

ololobus commented Sep 5, 2024 •

edited

Loading