-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ddl: handle cluster state retrieval failure #53816
Conversation
Hi @tangenta. Thanks for your PR. PRs from untrusted users cannot be marked as trusted with I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #53816 +/- ##
=================================================
- Coverage 72.5705% 55.9252% -16.6454%
=================================================
Files 1506 1627 +121
Lines 431000 609124 +178124
=================================================
+ Hits 312779 340654 +27875
- Misses 98912 245245 +146333
- Partials 19309 23225 +3916
Flags with carried forward coverage won't be shown. Click here to find out more.
|
/retest |
@tangenta: Cannot trigger testing until a trusted user reviews the PR and leaves an In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
@tangenta you can approve yourself |
Author cannot approve his own PR.. |
/approve |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: D3Hunter, lance6716, tangenta, zimulala The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
we should keep retry as long as we're still the owner, like what |
/hold |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rest lgtm
/unhold |
What problem does this PR solve?
Issue Number: close #53808
Problem Summary:
When TiDB is just elected as the DDL owner and
startDispatchLoop
is not started, there maybe a "context canceled" error when owner retired for some reasons(like PD connection issue):What changed and how does it work?
Since
startDispatchLoop
is not only called at TiDB cluster initialization, we should not os.Exit(1).Check List
Tests
Side effects
Documentation
Release note
Please refer to Release Notes Language Style Guide to write a quality release note.