-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
openshift-sdn network restart terminates run once pods immediately #16632
Comments
It terminates pods if (and only if) it can't re-establish networking to them. The assumption was that kubernetes would restart the pod in that case but I guess that doesn't work in all cases. But it would only be unable to re-establish networking to them if something went wrong during the restart. This is basically a dup/extension of #16630. |
Yeah, we should probably not be restarting restart=Never pods, because there is nothing we can do anymore (its network is going to continue broken, it's up to the container to die). |
p1 because we terminate pods that might run safely to completion |
Issues go stale after 90d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle stale |
Stale issues rot after 30d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle rotten |
Rotten issues close after 30d of inactivity. Reopen the issue by commenting /close |
The CRI net namespace restart function when openshift-sdn restarts is terminating run once pods that may not need networking, leading to failures.
It's not clear to me that completely terminating all run-once pods on a node when the sdn process is disrupted is correct.
@openshift/sig-networking
The text was updated successfully, but these errors were encountered: