apiserver errors should include reason #50622

discordianfish · 2017-08-14T16:33:27Z

/kind feature

What happened:
Spend too much time figuring out that my apiserver didn't start because the etcd DNS name couldn't be resolved. I got the following error:

Error: error waiting for etcd connection: timed out waiting for the condition

And even --v=10 didn't provide any more context. This was very misleading and made me assume the issue is on application level.

What you expected to happen:
I expected the apiserver error to tell me that DNS resolution wasn't working.

How to reproduce it (as minimally and precisely as possible):
I used the bootkube manifest but just starting the apiserver with --etcd-servers pointing to a nonexisting name.

The text was updated successfully, but these errors were encountered:

discordianfish · 2017-08-14T16:35:49Z

@kubernetes/sig-cluster-ops

zhangxiaoyu-zidif · 2017-08-14T21:15:54Z

/area apiserver

xiangpengzhao · 2017-08-17T03:53:25Z

maybe
/sig api-machinery
also.

mml · 2017-08-17T21:09:18Z

/assign @jpbetz cc @mml

k8s-ci-robot · 2017-08-17T21:09:19Z

@mml: GitHub didn't allow me to assign the following users: jpbetz, cc.

Note that only kubernetes members can be assigned.

In response to this:

/assign @jpbetz cc @mml

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Cryptophobia · 2017-08-28T19:34:10Z

@discordianfish : How did you ultimately fix this issue? We are getting the same message when deploying with kops and we think its a DNS issue but we keep getting this error in the kube-apiserver.log file.

discordianfish · 2017-08-29T09:52:35Z

@Cryptophobia It's been a "special" setup, but in general you can verify whether it's a DNS issue by using nslookup or dig.

Cryptophobia · 2017-08-29T20:04:05Z

@discordianfish : It turns out instanceGroups are very important for configuring DNS Route53 and etcd during cluster configuration with kops. If those instanceGroups are not defined correctly (particularly when there are even number of master or multiple groups of masters into one availability zone in a single region), etcd servers will not start and master nodes will not check in. This is definitely something that is not very well documented in the kops documentation.

discordianfish · 2017-08-30T08:56:52Z

@Cryptophobia Ah, you should fill an issue with kops then. This issue here is mostly about saving time by pointing you into the right direction.

Cryptophobia · 2017-08-30T13:43:00Z

Okay, I'll add another issue to the 784 issues already open. 😆 👍

ntfrnzn · 2017-09-27T04:53:22Z

I'm commenting here only to give my +1 to the underlying issue, that the error message emitted by the apiserver is not sufficient to begin debugging related problems.

I'm seeing the same error message, my experimental configuration is quite different, but it would be more pleasing if the apiserver told me clearly why it cannot talk to an etcd cluster, where it thinks that etcd cluster is, and so on.

Cryptophobia · 2017-09-27T14:49:14Z

Agreed. Better error messages and more error details going up to the api layer errors so we know what kubernetes apiserver is actually trying to.

fejta-bot · 2018-01-11T07:26:37Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

discordianfish · 2018-01-11T16:24:15Z

/remove-lifecycle stale

Cryptophobia · 2018-01-11T23:16:25Z

/lifecycle frozen

Cryptophobia · 2018-01-11T23:17:07Z

This would be a really nice feature. I wish I knew more Go so I could help out.

jordy25519 · 2018-03-07T23:08:42Z

I've opened a PR for this. Could use a review

nikhita · 2018-06-13T15:12:20Z

I've opened a PR for this.

@Holygits thanks!!

I'll remove the help-wanted label on this. :)

/remove-help

aaronchall · 2024-02-07T17:44:49Z

Will this issue ever be addressed?

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Aug 14, 2017

k8s-github-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Aug 14, 2017

k8s-ci-robot added the area/api Indicates an issue on api area. label Aug 14, 2017

k8s-ci-robot added the area/apiserver label Aug 14, 2017

k8s-ci-robot added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label Aug 17, 2017

k8s-github-robot removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Aug 17, 2017

k8s-ci-robot assigned mml Aug 17, 2017

Cryptophobia mentioned this issue Aug 30, 2017

Better documentation for InstanceGroups kubernetes/kops#3316

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 11, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 11, 2018

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 11, 2018

discordianfish mentioned this issue Jan 18, 2018

Kubeadm HA ( high availability ) checklist kubernetes/kubeadm#261

Closed

10 tasks

sttts added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Mar 1, 2018

jordy25519 mentioned this issue Mar 6, 2018

Propogate etcd preflight network errors upstream #50622 [WIP] #60829

Closed

k8s-ci-robot removed the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Jun 13, 2018

mml removed their assignment Nov 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apiserver errors should include reason #50622

apiserver errors should include reason #50622

discordianfish commented Aug 14, 2017

discordianfish commented Aug 14, 2017

zhangxiaoyu-zidif commented Aug 14, 2017

xiangpengzhao commented Aug 17, 2017

mml commented Aug 17, 2017

k8s-ci-robot commented Aug 17, 2017

Cryptophobia commented Aug 28, 2017 •

edited

Loading

discordianfish commented Aug 29, 2017

Cryptophobia commented Aug 29, 2017 •

edited

Loading

discordianfish commented Aug 30, 2017

Cryptophobia commented Aug 30, 2017

ntfrnzn commented Sep 27, 2017

Cryptophobia commented Sep 27, 2017

fejta-bot commented Jan 11, 2018

discordianfish commented Jan 11, 2018

Cryptophobia commented Jan 11, 2018

Cryptophobia commented Jan 11, 2018

jordy25519 commented Mar 7, 2018

nikhita commented Jun 13, 2018

aaronchall commented Feb 7, 2024

apiserver errors should include reason #50622

apiserver errors should include reason #50622

Comments

discordianfish commented Aug 14, 2017

discordianfish commented Aug 14, 2017

zhangxiaoyu-zidif commented Aug 14, 2017

xiangpengzhao commented Aug 17, 2017

mml commented Aug 17, 2017

k8s-ci-robot commented Aug 17, 2017

Cryptophobia commented Aug 28, 2017 • edited Loading

discordianfish commented Aug 29, 2017

Cryptophobia commented Aug 29, 2017 • edited Loading

discordianfish commented Aug 30, 2017

Cryptophobia commented Aug 30, 2017

ntfrnzn commented Sep 27, 2017

Cryptophobia commented Sep 27, 2017

fejta-bot commented Jan 11, 2018

discordianfish commented Jan 11, 2018

Cryptophobia commented Jan 11, 2018

Cryptophobia commented Jan 11, 2018

jordy25519 commented Mar 7, 2018

nikhita commented Jun 13, 2018

aaronchall commented Feb 7, 2024

Cryptophobia commented Aug 28, 2017 •

edited

Loading

Cryptophobia commented Aug 29, 2017 •

edited

Loading