-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-37060][CORE] Handle driver status response from backup masters #34331
Conversation
Can one of the admins verify this patch? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know enough to review this. Can you CC the author of the change before this?
Could someone please review these changes? CC: @cloud-fan @Ngone51 @HeartSaVioR @jiangxb1987 |
@mohamadrezarostami you may need to rebase your branch to pass GA. |
…sponse from backup masters
Done! |
Thanks, merged to master. @mohamadrezarostami Could you create PRs to backport this to branch-3.2/branch-3.1? |
@Ngone51 |
What changes were proposed in this pull request?
After an improvement in SPARK-31486, contributor uses 'asyncSendToMasterAndForwardReply' method instead of 'activeMasterEndpoint.askSync' to get the status of driver. Since the driver's status is only available in active master and the 'asyncSendToMasterAndForwardReply' method iterate over all of the masters, we have to handle the response from the backup masters in the client, which the developer did not consider in the SPARK-31486 change. So drivers running in cluster mode and on a cluster with multi masters affected by this bug.
Why are the changes needed?
We need to find if the response received from a backup master client must ignore it.
Does this PR introduce any user-facing change?
No, It's only fixed a bug and brings back the ability to deploy in cluster mode on multi-master clusters.
How was this patch tested?