Fix firewall to allow etcd client traffic between controllers #287

dghubble · 2018-08-22T06:44:06Z

Broaden internal-etcd firewall rule to allow etcd client traffic (2379) from other controller nodes
Previously, kube-apiservers were only able to connect to their node's local etcd peer. While master node outages were tolerated, reaching a healthy peer took longer than necessary in some cases
Reduce time needed to bootstrap a cluster

dghubble · 2018-08-22T06:49:19Z

This was most evident from running kubectl get cs (ignore scheduler and controller manager) several times on a GCP cluster. Notice, whichever apiserver handles the request, you'll see it views its own etcd peer as healthy and the others as unreachable. In reality, each etcd peer in the 3-etcd cluster is fine.

etcd-0               Healthy     {"health":"true"}                                                                                                                                            
etcd-2               Unhealthy   Get https://yavin-etcd2.domain.com.:2379/health: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)   
etcd-1               Unhealthy   Get https://yavin-etcd1.domain.com.:2379/health: net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)

With the fix, the results align with other multi-master clusters.

etcd-0               Healthy     {"health":"true"}                                                                                                                                            
etcd-1               Healthy     {"health":"true"}                                                                                                                                            
etcd-2               Healthy     {"health":"true"}

Note, you can still shutdown masters if a quorum of nodes are up and expect kubectl to work. So this doesn't directly impact availability, but was definitely undesired.

* Broaden internal-etcd firewall rule to allow etcd client traffic (2379) from other controller nodes * Previously, kube-apiservers were only able to connect to their node's local etcd peer. While master node outages were tolerated, reaching a healthy peer took longer than neccessary in some cases * Reduce time needed to bootstrap a cluster

dghubble added kind/bug platform/google-cloud labels Aug 22, 2018

dghubble force-pushed the fix-etcd-firewall branch from 0ecaabf to e58b424 Compare August 22, 2018 06:51

dghubble merged commit e58b424 into master Aug 22, 2018

dghubble deleted the fix-etcd-firewall branch September 3, 2018 18:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix firewall to allow etcd client traffic between controllers #287

Fix firewall to allow etcd client traffic between controllers #287

dghubble commented Aug 22, 2018 •

edited

Loading

dghubble commented Aug 22, 2018 •

edited

Loading

Fix firewall to allow etcd client traffic between controllers #287

Fix firewall to allow etcd client traffic between controllers #287

Conversation

dghubble commented Aug 22, 2018 • edited Loading

dghubble commented Aug 22, 2018 • edited Loading

dghubble commented Aug 22, 2018 •

edited

Loading

dghubble commented Aug 22, 2018 •

edited

Loading