Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

etcd shutdown failed due to grpc: addrConn.createTransport connection error #11208

Closed
s2li opened this issue Oct 8, 2019 · 4 comments
Closed
Labels

Comments

@s2li
Copy link

s2li commented Oct 8, 2019

Specific:
etcd Version: 3.3.0+git
Git SHA: Not provided (use ./build instead of go build)
Go Version: go1.12.5
Go OS/Arch: linux/amd64

but we use clientV2.

etcd cluster deployment

issue was happened on single node shutdown phase.
loop print addrConn.createTransport connection error, and etcd server process can't exit after received sigterm.

logs:
Sep 25 14:08:28 node1 etcd[2864]: received terminated signal, shutting down...
Sep 25 14:08:28 node1 etcd-start-up.sh[2864]: WARNING: 2019/09/25 14:08:28 grpc:
addrConn.createTransport failed to connect to {127.0.0.1:2379 0 }. Err :connection error:
desc = "transport: Error while dialing dial tcp 127.0.0.1:2379: connect: connection refused".
Reconnecting...
...
Sep 25 14:09:14 node1 etcd-start-up.sh[2864]: WARNING: 2019/09/25 14:09:14 grpc:
addrConn.createTransport failed to connect to {127.0.0.1:2379 0 }. Err :connection error:
desc = "transport: Error while dialing dial tcp 127.0.0.1:2379: connect: connection refused".
Reconnecting...
-- Reboot --

Reproducible:
yes, but low, observe 2+ cases by reboot command.

I searched history issues, and found it's similar with #8267, could you confirm whether same one or a new case? thanks!

what we do / WR for this issue:
we currently set TimeoutStopSec to low value to speed etcd service exit by sigkill.

@soniadimitrov
Copy link

Also seeing with same with 3.3.18, the only way to recover is to sigkill the etcd process:

2020-01-30 18:21:33.629070 I | etcdmain: etcd Version: 3.3.18
2020-01-30 18:21:33.629117 I | etcdmain: Git SHA: 3c8740a
2020-01-30 18:21:33.629122 I | etcdmain: Go Version: go1.13.4
2020-01-30 18:21:33.629126 I | etcdmain: Go OS/Arch: linux/amd64

2020-01-30 18:15:47.785992 N | pkg/osutil: received terminated signal, shutting down...
WARNING: 2020/01/30 18:15:47 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:15:48 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:15:50 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:15:52 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:15:57 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:16:05 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:16:16 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:16:31 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:16:56 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:17:37 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:18:55 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...
WARNING: 2020/01/30 18:20:28 grpc: addrConn.createTransport failed to connect to {72.247.39.22:2379 0 }. Err :connection error: desc = "transport: Error while dialing dial tcp 72.247.39.22:2379: connect: connection refused". Reconnecting...

Is there a fix for this in 3.4?

@fengzixu
Copy link

same problem

@aviam
Copy link

aviam commented Apr 6, 2020

Hi,
Any update? getting the same error when configuring vault with etcd as kv storage.
Any estimation when this will be fixed?

@stale
Copy link

stale bot commented Jul 5, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 21 days if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Jul 5, 2020
@stale stale bot closed this as completed Jul 26, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Development

No branches or pull requests

4 participants