Replies: 4 comments
-
I'm not sure that it was ever resolved; but it's not something that K3s or containerd would do. Best I can tell something in the GPU operator is terminating the pods, but I have no idea why it would do that. |
Beta Was this translation helpful? Give feedback.
-
Hey, so I don't think it is, as its all pods on the node, even the ones without the runtime option. |
Beta Was this translation helpful? Give feedback.
-
@brandond any other suggestions? |
Beta Was this translation helpful? Give feedback.
-
Converting to a discussion, since this doesn't seem to be a k3s bug |
Beta Was this translation helpful? Give feedback.
-
Environmental Info:
K3s Version:
Node(s) CPU architecture, OS, and Version:
Cluster Configuration:
1 Server, 1 GPU Agent, configured using https://github.com/alexellis/k3sup that are on different clouds
Describe the bug:
Essentially, I cannot run anything on the agent node because after 1-2mins, the pods restart.
Steps To Reproduce:
I doubt this is re-produceable, and instead think its caused by something being configured differently where I am hosting the GPU agent node (On Openstack through https://isaic.ca/), but litrally just running
k3sup join
on my system creates this problem.Expected behavior:
Default Pods (Node Exporter and Traefik) should come up and run fine
Actual behavior:
Pods constantly restart. The Traefik Pods have an error, the node exporter does not.
Additional context / logs:
Some things in here appear strange to me.
flannel.alpha.coreos.com/public-ip: 192.168.1.56
is set to the public IP, and from Lens https://k8slens.dev/ the memory is marked as N/A https://photos.app.goo.gl/xudHwja6YXhLZZ5r7I have verified that the container runtime is okay
Maybe one interesting thing to note, is other times I have configured k3s, and on the master node in this case, the internal node IP is always set to the external node IP, see
The contabo nodes internal IP is actually its external IP. I wonder if that has anything to do with this?
I also got a cgroup systemd error originally when setting this up, so every time I reboot I also run
Also, legacy iptables were not originally installed, so I fixed that with
Again, not sure if that info is relevant but thought it would be useful.
Beta Was this translation helpful? Give feedback.
All reactions