Reduce the impact of etcd leader on the availability of PD leader #7499

JmPotato · 2023-12-06T07:21:15Z

We have met several cases showing that even if the PD leader can no longer provide services, the etcd leader does not switch, resulting in the entire cluster being unable to provide services and affecting the high availability of PD, ref #7251, pingcap/tidb#48204 and pingcap/tidb#48206.

Because our implementation adopts the design of a PD leader elected based on the etcd leader, we need to determine more clearly whether the current unavailable scenario requires switching the etcd leader rather than just the PD leader to achieve the higher availability.

…export (#7501) ref #7499 Provide methods to read and write `campaignTimes` instead of export. Signed-off-by: JmPotato <ghzpotato@gmail.com>

#7725) ref #7499 Sort out the initialization functions of etcd client. Signed-off-by: JmPotato <ghzpotato@gmail.com>

ref #7499 Refine the etcd client healthy checker code. Signed-off-by: JmPotato <ghzpotato@gmail.com>

ref #7499, ref #7730 Return the originally picked endpoints directly if all are evicted to gain better availability. Signed-off-by: JmPotato <ghzpotato@gmail.com>

ref #7499 member: reset campaign times after successful resign Signed-off-by: husharp <jinhao.hu@pingcap.com>

JmPotato · 2024-02-06T08:48:09Z

Close with #7737.

JmPotato added the type/enhancement The issue or PR belongs to an enhancement. label Dec 6, 2023

JmPotato mentioned this issue Dec 6, 2023

election: provide methods to read and write campaignTimes instead of export #7501

Merged

ti-chi-bot bot pushed a commit that referenced this issue Dec 7, 2023

election: provide methods to read and write campaignTimes instead of …

6080557

…export (#7501) ref #7499 Provide methods to read and write `campaignTimes` instead of export. Signed-off-by: JmPotato <ghzpotato@gmail.com>

JmPotato mentioned this issue Jan 4, 2024

member, server: randomly check the etcd leader health to proactively resign #7661

Closed

HuSharp mentioned this issue Jan 5, 2024

lease: check etcd leader healthy by KeepAliveOnce #7670

Closed

JmPotato mentioned this issue Jan 17, 2024

etcdutil, server: sort out the initialization functions of etcd client #7725

Merged

ti-chi-bot bot pushed a commit that referenced this issue Jan 17, 2024

etcdutil, server: sort out the initialization functions of etcd client (

aa9c83c

#7725) ref #7499 Sort out the initialization functions of etcd client. Signed-off-by: JmPotato <ghzpotato@gmail.com>

JmPotato mentioned this issue Jan 17, 2024

etcdutil: refine the etcd client healthy checker code #7727

Merged

ti-chi-bot bot pushed a commit that referenced this issue Jan 17, 2024

etcdutil: refine the etcd client healthy checker code (#7727)

8f4f81f

ref #7499 Refine the etcd client healthy checker code. Signed-off-by: JmPotato <ghzpotato@gmail.com>

HuSharp mentioned this issue Jan 18, 2024

etcdutil: remove client when etcd server is unhealthy #7729

Closed

This was referenced Jan 18, 2024

Enhance the detection mechanism for the unhealthy etcd node #7730

Closed

etcdutil: consider the latency while patrolling the healthy endpoints #7737

Merged

etcdutil: move health checker into a separate file #7743

Merged

JmPotato mentioned this issue Jan 30, 2024

etcdutil: return original endpoints when all are evicted #7779

Merged

HuSharp mentioned this issue Feb 2, 2024

member: reset campaign times after successful resign #7795

Merged

ti-chi-bot bot pushed a commit that referenced this issue Feb 2, 2024

member: reset campaign times after successful resign (#7795)

54ffd34

ref #7499 member: reset campaign times after successful resign Signed-off-by: husharp <jinhao.hu@pingcap.com>

JmPotato closed this as completed Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the impact of etcd leader on the availability of PD leader #7499

Reduce the impact of etcd leader on the availability of PD leader #7499

JmPotato commented Dec 6, 2023

JmPotato commented Feb 6, 2024

Reduce the impact of etcd leader on the availability of PD leader #7499

Reduce the impact of etcd leader on the availability of PD leader #7499

Comments

JmPotato commented Dec 6, 2023

JmPotato commented Feb 6, 2024