Skip to content
This repository has been archived by the owner on Mar 28, 2020. It is now read-only.

e2e flake: etcd pod failed on bootstrap #1300

Closed
hongchaodeng opened this issue Jul 16, 2017 · 3 comments
Closed

e2e flake: etcd pod failed on bootstrap #1300

hongchaodeng opened this issue Jul 16, 2017 · 3 comments
Labels

Comments

@hongchaodeng
Copy link
Member

link: https://jenkins-etcd.prod.coreos.systems/job/etcd-operator-master/558/console

Operator added member "test-etcd-jf0xc-0001". But then later, the pod is gone.

From logs of pod "test-etcd-jf0xc-0001":

etcdserver: could not get cluster response from http://test-etcd-jf0xc-0000.test-etcd-jf0xc.e2e-etcd-operator-master-558.svc.cluster.local:2380: Get http://test-etcd-jf0xc-0000.test-etcd-jf0xc.e2e-etcd-operator-master-558.svc.cluster.local:2380/members: dial tcp: i/o timeout
etcdmain: cannot fetch cluster info from peer urls: could not retrieve cluster information from the given urls
@hongchaodeng
Copy link
Member Author

hongchaodeng commented Sep 8, 2017

This timeout happens only 1s after bootstrap.. The 1s is too short

2017-09-08 18:16:57.071859 I | embed: listening for client requests on 0.0.0.0:2379
2017-09-08 18:16:58.276640 W | etcdserver: could not get cluster response from http://test-etcd-w7s0f-0000.test-etcd-w7s0f.e2e-etcd-operator-2264.svc:2380: Get http://test-etcd-w7s0f-0000.test-etcd-w7s0f.e2e-etcd-operator-2264.svc:2380/members: dial tcp: i/o timeout
2017-09-08 18:16:58.336459 C | etcdmain: cannot fetch cluster info from peer urls: could not retrieve cluster information from the given urls

@hongchaodeng
Copy link
Member Author

cc @fanminshi

@hongchaodeng
Copy link
Member Author

This is fixed since we added checking in init container.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

1 participant