-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Liveness Probe on multi-node etcd #280
Comments
Reading this I wonder about the option keeping the current liveness probe but setting |
yes, this make sense to me .... I don’t see any major issue with this.
|
Hi @timuthy ,
|
As discussed, the assumption wasn't that all members are down but rather that if e.g. |
I'm not saying last pod is killed due to liveness probe ... I'm saying last pod can be killed/crash due to some reason, why you are assuming that last pod can’t be killed. |
The point I'm making is, suppose all pods/member of etcd-cluster crashes/goes down due to some reasons, they will not be able to come up due to this livenessProbe: |
I assume that they will come up fast enough, so that the livenessProbes don't fail beyond the expected threshold ( |
yeah exactly but then we have to depend on this configuration and set them accordingly. |
IMO using |
Can you sketch how an implementation of |
Reopening this issue as |
I'd be interested in knowing what is planned here. Do you already have something in mind? |
I reopened it for future reference as whenever we will upgrade our Etcd version(may be to 3.6) then we can also update the livenessProbe for etcd. |
TBH, I don't get the motivation of having an open issue if there is no issue today (as claimed by #280 (comment)). If you want to keep this as a future reference then you can still take the information from this issue, even if it's closed. Either we see a problem with the readiness/liveness probes as they are configured today and discuss/implement a solution or we open a ☂️ issue for upgrading to etcd 3.6 and list all the requirements/changes once we plan to do so. |
ok, then feel free to close. |
Found out the upstream issue which is also described in this comment #280 (comment) |
We had introduced the livenessProbe in multi-node etcd with this PR: #396 /on-hold |
This issue needs to pick after #445 etcd version upgrade. |
/assign @ishan16696 |
Feature (What you would like to be added):
Configure the correct livenessProbe in
multi-node etcd
.Motivation (Why is this needed?):
Right now I found 2 ways to configure the livenessProbe:
/health
endpoint/health
endpoint returnsfalse
if one of the following conditions is met (source):Disadvantage of Method 1 (
/health
endpoint).ETCDCTL_API=3 etcdctl endpoint health --endpoints=${ENDPOINTS} --command-timeout=Xs
etcdctl endpoint health
command performs a GET on the "health" key(source)Disadvantage of Method 2 (
etcdctl endpoint health
).This issue of liveness is already reported on etcd here and it has been fixed by this PR but this feature will come with etcd v3.6 and we don't know the timeline when it will be released.
What options we have:
etcdctl endpoint health
) and whenever etcd release v3.6, switch to that version and configure the correct liveness probe.The text was updated successfully, but these errors were encountered: