Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KCP: ensure machine that is being deleted during upgrades or scale down isn't the etcd leader #2398

Closed
randomvariable opened this issue Feb 21, 2020 · 11 comments · Fixed by #2525
Assignees
Labels
area/control-plane Issues or PRs related to control-plane lifecycle management help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
Milestone

Comments

@randomvariable
Copy link
Member

Detailed Description

For correctness, the control plane controller should ensure etcd leadership is on a machine that isn't about to be deleted. Two optimisations are possible:

  • Where there are at least two candidates for deletion, pick the machine which is not etcd leader
  • Where there is only one candidate, move the etcd leadership to a different machine.

/kind feature
/help

@k8s-ci-robot
Copy link
Contributor

@randomvariable:
This request has been marked as needing help from a contributor.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

Detailed Description

For correctness, the control plane controller should ensure etcd leadership is on a machine that isn't about to be deleted. Two optimisations are possible:

  • Where there are at least two candidates for deletion, pick the machine which is not etcd leader
  • Where there is only one candidate, move the etcd leadership to a different machine.

/kind feature
/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@k8s-ci-robot k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. labels Feb 21, 2020
@detiber
Copy link
Member

detiber commented Feb 21, 2020

I would argue that we should probably always handle this by moving the leadership, otherwise we risk causing conflicts with the failure domain handling.

@vincepri
Copy link
Member

In terms of milestone, when do we want to tackle this?

@detiber
Copy link
Member

detiber commented Feb 21, 2020

I think we still want to attempt to get this in prior to v0.3.0.
/milestone v0.3.0

@k8s-ci-robot k8s-ci-robot added this to the v0.3.0 milestone Feb 21, 2020
@vincepri
Copy link
Member

/priority important-soon

@k8s-ci-robot k8s-ci-robot added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Feb 21, 2020
@alexander-demicev
Copy link
Contributor

/assign @alexander-demichev

@vincepri
Copy link
Member

vincepri commented Mar 2, 2020

Are we planning to do this for v0.3.0 or should we punt this to v0.3.x?

@chuckha
Copy link
Contributor

chuckha commented Mar 4, 2020

let's leave it in v0.3.0 and if it looks like it will merge today i'll put it in the RC milestone

@michaelgugino
Copy link
Contributor

We need to update the first comment or title to reflect that this is specifically about upgrades.

@vincepri
Copy link
Member

vincepri commented Mar 5, 2020

/retitle KCP: ensure machine that is being deleted during upgrades isn't the etcd leader

@k8s-ci-robot k8s-ci-robot changed the title kubeadmcontrolplane - ensure machine that is being deleted isn't the etcd leader KCP: ensure machine that is being deleted during upgrades isn't the etcd leader Mar 5, 2020
@detiber detiber changed the title KCP: ensure machine that is being deleted during upgrades isn't the etcd leader KCP: ensure machine that is being deleted during upgrades or scale down isn't the etcd leader Mar 5, 2020
@vincepri
Copy link
Member

vincepri commented Mar 9, 2020

/milestone v0.3.x

@k8s-ci-robot k8s-ci-robot modified the milestones: v0.3.0, v0.3.x Mar 9, 2020
@vincepri vincepri added the area/control-plane Issues or PRs related to control-plane lifecycle management label Mar 23, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/control-plane Issues or PRs related to control-plane lifecycle management help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

7 participants