-
Notifications
You must be signed in to change notification settings - Fork 297
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[4.7 vSphere IPI] systemd-resolved configuration fails on installation #690
[4.7 vSphere IPI] systemd-resolved configuration fails on installation #690
Comments
Does this happen before -firstboot service run? Seems installer / machine-config-operator scripts should be creating the dir just in case too |
My feeling is that since this is supposed to be provided by FCOS and has been in the past it should be there... adding the directory "fixes" the symptom but not the issue EDIT: This turns out to not be an FCOS issue, but an OKD config issue as mentioned below. OKD was creating the directory structure described above |
I need to look at the previous release to see when that directory gets created |
So, fortunately this is not an issue with FCOS... Workaround until fix above is implemented: ssh into each affected node:
|
https://amd64.origin.releases.ci.openshift.org/releasestream/4.7.0-0.okd/release/4.7.0-0.okd-2021-06-14-203151 should have installer/machine-config-operator fixes for that (but since the squashed in existing commits no diff is displayed). vrutkovs/machine-config-operator@53bfabc |
I was able to succesfully install the cluster:
|
hey @fortinj66 is there any chance you can help me check this error dial tcp lookup image-registry.openshift-image-registry.svc on 10.10.8.132-53 no such host |
same problem here, @bobby0724
the
within the dns-default pod of openshift-dns namepsace the "." is missing
this leads to a "parse of /etc/resolv.conf failed" within the dns-node-resolver container of the dns-default pod. my quickfix was to add a "Domain=foo" in /etc/systemd/resolved.conf via machine-config So crio is working as expected with the correct
validated with:
of course this is a workaround only. I assume, that the root cause should be found in the chain |
Thanks for the explanation, I have update my DHCP settings to deliver hostname and domain, I have reinstall the cluster using UPI with DHCP and now the issue is gone |
Fixed in https://amd64.origin.releases.ci.openshift.org/releasestream/4-stable/release/4.7.0-0.okd-2021-06-19-191547, please reopen if this still happens |
Describe the bug
with the newest stable release, systemd-resolved is misconfigured due to a missing directory:
/etc/systemd/resolved.conf.d
should be created by FCOS and it is not.Since DNS resolution is now broken, cluster does not complete installation.
This seems to be a FCOS 34 issue as it does not happen with FCOS 33.
Note that latest stable uses FCOS 34 as the initial bootstrap image. Prior stable use FCOS 33.
Version
OpenShift Installer 4.7.0-0.okd-2021-06-13-090745
How reproducible
100%
The text was updated successfully, but these errors were encountered: