Operator changes for local disk clusters #6942

joejulian · 2022-10-26T00:48:00Z

Cover letter

To support local disk, we need to use the automatic node id and automatic bootstrap, among other things. This is the start of that change.

Each additional body of work will be in its own commit, to make review easier.

Fixes #ISSUE-NUMBER, Fixes #ISSUE-NUMBER, ...

Backport Required

UX changes

Describe in plain language how this PR affects an end-user. What topic flags, configuration flags, command line flags, deprecation policies etc are added/changed.

Release notes

src/go/k8s/controllers/redpanda/cluster_controller.go

RafalKorepta · 2022-11-15T05:28:04Z

/backport v22.3.x

BenPope · 2022-11-16T21:01:14Z

@joejulian Please fill out the backport section of the cover letter

Previous usage of finalizer handlers was unreliable in the case of flipping Kubernetes Nodes ready status. Local SSD disks that could be attached to Redpanda Pod prevents rescheduling as the Persistent Volume affinity bounds Pod to only one Node. In case of Kubernetes Node coming back to live Cluster controller could already delete Redpanda data (PVC deletion and Redpanda decommissioning). If particular Redpanda Node would host single replica partition, then it would be a data lost. If the majority of Redpanda process would run in unstable Kubernetes Nodes, then Redpanda operator could break whole cluster by losing Raft quorum. Reference #112 redpanda-data/redpanda#6942

github-actions bot added area/k8s area/rpk labels Oct 26, 2022

joejulian force-pushed the decommission branch 12 times, most recently from 7848052 to adee505 Compare October 28, 2022 00:45

twmb removed the area/rpk label Oct 28, 2022

joejulian force-pushed the decommission branch from adee505 to a95b536 Compare October 28, 2022 17:01

github-actions bot added the area/rpk label Oct 28, 2022

andrwng mentioned this pull request Oct 28, 2022

rpk: updates for bootstrapping #6852

Closed

6 tasks

joejulian force-pushed the decommission branch 12 times, most recently from c5a01c3 to 3c18a1c Compare November 3, 2022 22:11

joejulian force-pushed the decommission branch 6 times, most recently from 4ce5b9e to d3eaa14 Compare November 12, 2022 07:30

0xdiba reviewed Nov 14, 2022

View reviewed changes

src/go/k8s/controllers/redpanda/cluster_controller.go Show resolved Hide resolved

joejulian added 12 commits November 14, 2022 16:09

fix tooling to work with go 1.19

8822e8f

Boot cluster without node ids and a non-empty bootstrap list

bccfe8c

can CI handle this?

ff966e3

add node-id annotation to pods

ba415c6

add logic for decommissioning a broker with a lost kubernetes node

2baf143

add finalizer to cluster

d836bf1

use the right configurator in tests

5b1a6fa

update-conf-image test alters the operator and cannot run in parallel

dffd687

change the name of the secret

a812aee

fix missing Configured condition

8d4a110

add intermediate upgrade stages to tests

caeae96

add flag for PVC deletion

2b1b73a

joejulian force-pushed the decommission branch from 5153365 to 2b1b73a Compare November 15, 2022 00:09

RafalKorepta approved these changes Nov 15, 2022

View reviewed changes

RafalKorepta merged commit e51d5b7 into redpanda-data:dev Nov 15, 2022

vbotbuildovich mentioned this pull request Nov 15, 2022

[v22.3.x] Operator changes for local disk clusters #7280

Merged

RafalKorepta mentioned this pull request Jun 18, 2024

Do not decommission when node is miss behaving redpanda-data/redpanda-operator#158

Closed

RafalKorepta mentioned this pull request Jun 19, 2024

Remove Cluster and Pod finalizers redpanda-data/redpanda-operator#160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Operator changes for local disk clusters #6942

Operator changes for local disk clusters #6942

joejulian commented Oct 26, 2022

RafalKorepta commented Nov 15, 2022

BenPope commented Nov 16, 2022

Operator changes for local disk clusters #6942

Operator changes for local disk clusters #6942

Conversation

joejulian commented Oct 26, 2022

Cover letter

Backport Required

UX changes

Release notes

RafalKorepta commented Nov 15, 2022

BenPope commented Nov 16, 2022