Check the node status when leader re-election. #24

kasonglee · 2020-08-05T04:44:15Z

Feature Request

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Example: "I have an issue when (...)"

From leader/leader.go, leader re-election works after the default timeout 5-min since the condition
Pod.status.phase == "Failed" && Pod.Status.Reason == "Evicted" when a worker node is failed.
I have an opinion that leader re-election can work almost immediately when the condition contains checking the status of the node where the leader pod is running.

Describe the solution you'd like
A clear and concise description of what you want to happen. Add any considered drawbacks.

Check the condition of the node where the leader pod is running with [Node.Type == "NodeReady" && Node.Status != "ConditionTrue"] and when the node has been failed, delete the leaderPod (only mark the pod with 'terminating' because the node where the pod is running has been failed) and the configmap lock whose OwnerReference is leaderPod)

Pictures below are the test of node-check for leader re-election.(test-operator-xxx-xxxqb was the leaderPod)

Making --pod-eviction-timeout to be short can be another approach. However, I sure that above approach can bring more reliability since we don't know appropriate time out.

And Is there any drawbacks when making --pod-eviction-timeout to be very very short?

@HyungJune

…work#24)

mhrivnak · 2020-08-06T14:58:20Z

And Is there any drawbacks when making --pod-eviction-timeout to be very very short?

The use case for the timeout is that a Node gets temporarily disconnected from the cluster but is then able to re-connect. The shorter the timeout, the more risk that the Node will re-appear and all the workloads on it will try to keep running despite having had their Pods deleted. Probably they would not run for long, but it may take some time for the local kubelet to catch up and stop all the containers.

This scenario greatly benefits from fencing. Ideally you allow a separate component to monitor the Node health, and then take action to ensure that a missing Node will not return before deleting its workloads. This can be done for example by powering off the Node's underlying machine.

kasonglee · 2020-08-07T07:17:52Z

Here's what I understood : "making --pod-eviction-timeout to be very short can be risky because it may take some time for the local kubelet to catch up and stop all the containers. The solution is the component to monitor the node health and ensure that the missing node will not be 'Ready' status until deleting its workloads."

I think that like my case, the operator may need the feature of checking the Node health because of the fast leader re-election. so I suggest making a separate go-package for checking the Node health in the operator-lib repository and the package will be used by the operator that run on all the nodes in the cluster (i.e daemon-set).(Of course, I need to modify my code to deleting all workloads on the missing node.)

* leader: check the node status for the leader-election (#24) * leader: enhance the coverage Co-authored-by: kasong_lee <kasong_lee@tmax.co.kr>

openshift-bot · 2020-11-05T07:52:51Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2020-12-05T09:49:53Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

Bryce-huang · 2020-12-29T10:20:39Z

Is there any solution?

Bryce-huang · 2020-12-29T10:21:20Z

operator-framework/operator-sdk#1813
operator-framework/operator-sdk#784

varshaprasad96 · 2021-01-11T21:44:06Z

There is still discussion required on modifying the controller-runtime interface to enable the use of leader for life/lease-based leader election. Since this particular issue concerns about waiting for a default timeout, even when the node status is false and a PR has already been merged to solve this, I am closing this issue. Please feel free to open another issue, if you would like to have any other modifications in the current (leader for life) implementation.

kasonglee pushed a commit to HyungJune/operator-lib that referenced this issue Aug 5, 2020

leader: check the node status for the leader-election (operator-frame…

240286a

…work#24)

kasonglee added a commit to HyungJune/operator-lib that referenced this issue Aug 5, 2020

leader: check the node status for the leader-election (operator-frame…

51a7b36

…work#24)

kasonglee added a commit to HyungJune/operator-lib that referenced this issue Aug 5, 2020

leader: check the node status for the leader-election (operator-frame…

72d6857

…work#24)

kasonglee added a commit to HyungJune/operator-lib that referenced this issue Aug 5, 2020

leader: check the node status for the leader-election (operator-frame…

4640c35

…work#24)

kasonglee added a commit to HyungJune/operator-lib that referenced this issue Aug 5, 2020

leader: check the node status for the leader-election (operator-frame…

64e709a

…work#24)

This was referenced Aug 6, 2020

leader: check the node status for the leader-election (#24) #25

Merged

Why leader re-election should work after the default timeout 5-min when a worker node is failed? operator-framework/operator-sdk#3498

Closed

openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 5, 2020

openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 5, 2020

estroz assigned varshaprasad96 Jan 11, 2021

varshaprasad96 closed this as completed Jan 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check the node status when leader re-election. #24

Check the node status when leader re-election. #24

kasonglee commented Aug 5, 2020 •

edited

Loading

mhrivnak commented Aug 6, 2020

kasonglee commented Aug 7, 2020

openshift-bot commented Nov 5, 2020

openshift-bot commented Dec 5, 2020

Bryce-huang commented Dec 29, 2020

Bryce-huang commented Dec 29, 2020

varshaprasad96 commented Jan 11, 2021 •

edited

Loading

Check the node status when leader re-election. #24

Check the node status when leader re-election. #24

Comments

kasonglee commented Aug 5, 2020 • edited Loading

Feature Request

mhrivnak commented Aug 6, 2020

kasonglee commented Aug 7, 2020

openshift-bot commented Nov 5, 2020

openshift-bot commented Dec 5, 2020

Bryce-huang commented Dec 29, 2020

Bryce-huang commented Dec 29, 2020

varshaprasad96 commented Jan 11, 2021 • edited Loading

kasonglee commented Aug 5, 2020 •

edited

Loading

varshaprasad96 commented Jan 11, 2021 •

edited

Loading