You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 30, 2024. It is now read-only.
When an UnreachableMasterWithLaggingReplicas is detected on the master, the SQL thread and and I/O threads are being restarted as part of the emergent action by the Orchestrator.
We noticed that stop and start of SQL thread on the replicas causes increase in the replication lag as the transaction being applied has to be rolled back and re-applied from start.
This change was introduced as part of #1010 where SQL thread is being restarted which we believe is causing this issue.
Orchestrator would still be able detect "Too Many Connections" issue even if only the I/O thread of the replica is restarted.
@shlomi-noach Will submit a PR as discussed reverting the code to restart only the I/O thread.
The text was updated successfully, but these errors were encountered:
gsraman
changed the title
RestartReplicationQuick causing increase in replication delay
RestartReplicationQuick causing increase in replication lag
Feb 16, 2021
When an UnreachableMasterWithLaggingReplicas is detected on the master, the SQL thread and and I/O threads are being restarted as part of the emergent action by the Orchestrator.
We noticed that stop and start of SQL thread on the replicas causes increase in the replication lag as the transaction being applied has to be rolled back and re-applied from start.
This change was introduced as part of #1010 where SQL thread is being restarted which we believe is causing this issue.
Orchestrator would still be able detect "Too Many Connections" issue even if only the I/O thread of the replica is restarted.
@shlomi-noach Will submit a PR as discussed reverting the code to restart only the I/O thread.
The text was updated successfully, but these errors were encountered: