Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

externalTrafficPolicy of tidb service should default to local in terraform deployment #879

Closed
aylei opened this issue Sep 6, 2019 · 4 comments
Labels
area/terraform good first issue Good for newcomers lifecycle/stale status/help-wanted Extra attention is needed type/bug Something isn't working

Comments

@aylei
Copy link
Contributor

aylei commented Sep 6, 2019

Bug Report

When the externalTrafficPolicy is set to cluster (the default value), every Kubernetes node is a valid backend for the load balancer, a single machine failure will impact part of new connections. More seriously, for immature clients that has a 0 connect-timeout (no timeout), the new connections that routed to the failure machine hangs forever.

With externalTrafficPolicy set to local, the nodes without backend pods route the traffic to blackhole, therefore, LB can mark these nodes as unhealthy when performs the initial check. local mode may cause unbalanced traffic, but not for our terraform deployment which uses dedicated node.

What did you do?

  1. Deploy TiDB cluster via the AWS terraform scripts
  2. Simulate inserts using mysqlcli
  3. Terminate a PD instance from the AWS console

What did you expect to see?
No impact on the workload

What did you see instead?
mysqlcli hangs forever

cc @tennix @weekface

ref:
https://kubernetes.io/docs/tasks/access-application-cluster/create-external-load-balancer/
https://www.asykim.com/blog/deep-dive-into-kubernetes-external-traffic-policies

@aylei aylei added type/bug Something isn't working area/terraform good first issue Good for newcomers status/help-wanted Extra attention is needed labels Sep 6, 2019
@sokada1221
Copy link
Contributor

I'll quickly fix this one as it's related to our use case.

@github-actions
Copy link

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 15 days

@sokada1221
Copy link
Contributor

@aylei i can't close the ticket - can you?

@aylei
Copy link
Contributor Author

aylei commented Feb 27, 2020

Yes.
closed via #947

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/terraform good first issue Good for newcomers lifecycle/stale status/help-wanted Extra attention is needed type/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants