-
Notifications
You must be signed in to change notification settings - Fork 297
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update from 4.11.0-0.okd-2023-01-14-152430 to 4.12.0-0.okd-2023-02-18-033438 failing #1527
Comments
Kubelet seems to be unavailable on both nodes, so the must-gather does not contain much logs from the nodes. Does
|
thanks for taking a look @melledouwsma Yeah, the kublet isn't running due to the absence of
|
Check logs on the node for "nm-dispatcher" - this would have logs from 30-resolv-prepender |
looks like maybe some more selinux gremlins?
Applying the workaround from #1425
|
Looks like a dupe of #1475 |
Yeah, i think you may be right. I swear tried this yesterday. Will open another issue if I encounter another problem. Apologies for the oversight on my part and thank you very much for the eyes and brains @vrutkovs and @melledouwsma. |
Describe the bug
while updating from 4.11.0-0.okd-2023-01-14-152430 to 4.12.0-0.okd-2023-02-18-033438 control plane degrades due to first member being upgraded failing to return.
It seems that two nodes were updated, one of them being a control plane member caused the upgrade to stop progressing.
During the update (after all the operators were updated) the vsphere-problem-detector threw the following log:
After seeing that I had our vmware infrastructure team add those privileges to the role our cluster service account uses in vsphere and that issue cleared up. However, this was all while the two nodes (one infra/worker and one master) were in the state they're in now and in the must-gather.
Both systems are reachable should diagnostics outside of the must-gather be helpful:
Version
4.11.0-0.okd-2023-01-14-152430 to 4.12.0-0.okd-2023-02-18-033438
vSphere IPI
How reproducible
1 for 1 right now. We'll try our other clusters if we can keep this one from tipping over.
Log bundle
must-gather.local.8365916698519417107.zip
The text was updated successfully, but these errors were encountered: