Leaderless QQs reporting noproc #12701
-
Community Support Policy
RabbitMQ version used3.13.6 How is RabbitMQ deployed?Debian package Steps to reproduce the behavior in questionHi Team 👋 We've been running a number of test scenarios over the last couple of months to evaluate QQ performance and resiliency (before upgrading some of our client critical CMQs) and sometimes (not often) we encounter leaderless QQs, where leaders become unavailable and are reported as We have been looking at and experimenting with a custom workaround CLI command which (efficiently) checks sanity of leaders, which we're looking to PR for comments and review, and to have available for use in later rabbit releases. Plan is to optionally use this command along with the already provided upgrade and maintenance queue checks when upgrading critical environments. Command signature we have so far (open to other name suggestions 🙏 ):
which spawns concurrent leader checks (with concurrency limit/threshold which we dont want to exceed), where:
We'll follow up with the implementation for further comments. This is to mitigate the major risk factor of |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 7 replies
-
I think the first thing to do is to provide information on how these queues failed so that we can fix that (if still an issue). |
Beta Was this translation helpful? Give feedback.
-
Keep in mind that RabbitMQ 3.13.x is out of community support, so any contributions in this area won't be available in OSS RabbitMQ 3.13.x. |
Beta Was this translation helpful? Give feedback.
-
This should be addressed by #12727 (contributed by @Ayanda-D) and available for testing in alpha |
Beta Was this translation helpful? Give feedback.
This should be addressed by #12727 (contributed by @Ayanda-D) and available for testing in alpha
4.0.4-alpha.0caa0664
.