-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bitnami/postgresql-repmgr] After a 3-node cluster, simultaneous restart of all nodes resulted in failure to start properly. #67372
Comments
For example, there are ABC nodes. Before they are all shut down, C is the master node. After they are pulled up at the same time, C is not running right. After B is started, it cannot find the master node. repmgr directly runs the postgres process in B, After C is started in this short period of time, it connects to B's 5432 service and finds that the master is C. The unfiltered IP address is C itslef, which causes an attempt to connect but fails to connect, resulting in a circular dependency between B and C. |
only check primary node is itself in https://github.com/bitnami/containers/blob/main/bitnami/postgresql-repmgr/15/debian-12/rootfs/opt/bitnami/scripts/librepmgr.sh#L224 but no check itself in https://github.com/bitnami/containers/blob/main/bitnami/postgresql-repmgr/15/debian-12/rootfs/opt/bitnami/scripts/librepmgr.sh#L240 when repmgr get primary node is itself, from other nodes postgres service, it retry connect self postgres serivce, but itself is not running ready |
Look similar #999 |
Thank you for bringing this issue to our attention. We appreciate your involvement! If you're interested in contributing a solution, we welcome you to create a pull request. The Bitnami team is excited to review your submission and offer feedback. You can find the contributing guidelines here. Your contribution will greatly benefit the community. Feel free to reach out if you have any questions or need assistance. |
look at this #67370 |
Thank you for opening this issue and submitting the associated Pull Request. Our team will review and provide feedback. Once the PR is merged, the issue will automatically close. Your contribution is greatly appreciated! |
Feel free to reach out if you have any questions or need assistance. |
This Issue has been automatically marked as "stale" because it has not had recent activity (for 15 days). It will be closed if no further activity occurs. Thanks for the feedback. |
Due to the lack of activity in the last 5 days since it was marked as "stale", we proceed to close this Issue. Do not hesitate to reopen it later if necessary. |
Name and Version
bitnami/postgresql-repmgr:15.5.0-debian-11-r15
What architecture are you using?
amd64
What steps will reproduce the bug?
What is the expected behavior?
The postgres cluster can recover into healthz
What do you see instead?
each pg pod in crash, can not election the primary
pg pod exit log
Additional information
The text was updated successfully, but these errors were encountered: