-
Notifications
You must be signed in to change notification settings - Fork 937
Slaves with broken replication should not be shown by which-cluster-osc-replicas #588
Comments
Thank you for the PR and I apologize for this late response, I missed this issue. There is some risk in only providing "good" replicas. Isn't there purpose to returning bad replicas, too? |
friendly ping |
Just to clarify the reasoning, imagine this is an OSC to free space on a fragmented table which would save the slaves that are still healthy, whilst some had already met their fate of "No space left on device". I certainly would not want a broken slave to be presented to me for this to be used as a control slave. Similarly, if The cleanest and simplest solution to the real issue (not knowing a potential control slave is broken from the command) is something like:
This could then be used in a number of ways, including selective filtering for the control slave. It also moves the logic of |
@cezmunsta Allow me to make things simpler even more, and add a |
The status of the slaves do not appear to be checked ahead of being suggested as candidate controllers for a online schema change (OSC).
The logs contain a message showing that
orchestrator
is fully aware of the issue:The following relates to a 2-node cluster where the slave is no longer replicating.
When requesting
which-cluster-osc-replicas
the slave is returned:Here is the live replication status:
Only healthy slaves would be expected to be suggested as candidate controllers for an OSC.
The text was updated successfully, but these errors were encountered: