confd bypasses local proxy, all instances connect directly to single cluster member #319

fasaxc · 2015-08-13T09:37:05Z

I'm doing some scale testing of my application with a 5-node etcd cluster and 500+ client hosts. I have an etcd proxy on each of my client hosts. confd seems to initially connect to the local proxy but then, on subsequent requests, it connects directly to a node in the cluster. Worse, all the confd instances seem to pick the same cluster node to talk to, which causes excessive load on that node and makes it unresponsive.

# /confd --version
confd 0.9.0

The text was updated successfully, but these errors were encountered:

bacongobbler · 2015-08-13T15:20:55Z

sounds like an issue with go-etcd, as that is the library we are using to connect.

xiang90 · 2015-08-13T18:23:19Z

try not to sync the cluster when you know you are reaching out a proxy.

Lukasa · 2015-08-18T10:28:01Z

As @xiang90 says, I think this is strictly an issue with the way confd uses go-etcd. On L31 of backends/etcd/client.go confd calls Client.SetCluster. This, behind the scenes, syncs the cluster.

This call causes go-etcd to query the etcd cluster and find all the machines in the cluster, and then sets the cluster information to that. I think simply removing this call would resolve the problem.

I'm going to try playing around with that change, to see what happens.

Lukasa · 2015-08-25T07:47:00Z

Ok, that appears to work, but we now encounter a problem when passing an authority without a scheme (i.e. just 'host:port', such as '127.0.0.1:4001'). Previously, the call to SetCluster would call go-etcd's Client.createHttpPath method, which can add a scheme to the front. This now doesn't happen. @kelseyhightower any objections to me bringing that method into confd so that I can use it here?

geku · 2016-01-26T11:35:34Z

For us this problem prevents an etcd cluster migration to new node. We have an etcd cluster and etcd-proxies, when etcd nodes get migrated to new nodes, the proxies still work and connect to the new etcd peers but confd stops working as it somehow misses the new etcd peers and returns a ERROR 501: All the given peers are not reachable.

In case the etcd would trust the proxy and not try to discover the peers itself, it would work.

HeavyHorst · 2016-02-21T13:31:38Z

This should be fixed in the latest master because of the switch to the new etcd client libs.

kelseyhightower · 2016-02-23T05:25:52Z

This should now be fixed on master now that we are using the new etcd client libs.

fasaxc mentioned this issue Aug 13, 2015

Cluster destabilises, one node using 100% CPU, others OK etcd-io/etcd#3261

Closed

Lukasa mentioned this issue Aug 25, 2015

Allow proper use of etcd proxies. #329

Closed

bacongobbler mentioned this issue Nov 10, 2015

confd fails talking to etcd in proxy mode #354

Closed

kelseyhightower closed this as completed Feb 23, 2016

laurrentt mentioned this issue Mar 30, 2016

chore(*): bump confd to v0.11.0 deis/deis#4888

Closed

ghost mentioned this issue Jun 20, 2016

etcd backend proxy mode #456

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

confd bypasses local proxy, all instances connect directly to single cluster member #319

confd bypasses local proxy, all instances connect directly to single cluster member #319

fasaxc commented Aug 13, 2015

bacongobbler commented Aug 13, 2015

xiang90 commented Aug 13, 2015

Lukasa commented Aug 18, 2015

Lukasa commented Aug 25, 2015

geku commented Jan 26, 2016

HeavyHorst commented Feb 21, 2016

kelseyhightower commented Feb 23, 2016

confd bypasses local proxy, all instances connect directly to single cluster member #319

confd bypasses local proxy, all instances connect directly to single cluster member #319

Comments

fasaxc commented Aug 13, 2015

bacongobbler commented Aug 13, 2015

xiang90 commented Aug 13, 2015

Lukasa commented Aug 18, 2015

Lukasa commented Aug 25, 2015

geku commented Jan 26, 2016

HeavyHorst commented Feb 21, 2016

kelseyhightower commented Feb 23, 2016