-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Calico node crash when create failsafe port #2904
Comments
is similar to #2901 |
Does kubespray run calico as non-root? Maybe we need a new permission. As a workaround you should be able to disable the XDP feature. |
@fasaxc it run as root |
Anything obviously different about the bad node? Is it configured as the master? Is it running any particular services? Some things that might help to find out what's special about that node:
and
I'[m wondering if that node had trouble mounting the BPF file system or if the BPF sysctls are disabling BPF calls. |
It seem all right... The bad node is not configured as the master |
Are you running any policing/enforcing apps on there? For example selinux or something that monitors syscalls? |
For anyone seeing this, a workaround should be to disable the XDP feature by setting the |
So, after reading more about this:
But I think this issue is still actionable... because there is still the fact that:
... After looking more at the code, (1) there is no backoff between the retries for the XDP shutdown thingy and (2) it appears that the |
@fasaxc @jayunit100 can we close this now that projectcalico/felix#2165 has gone in? |
For anyone else that comes across this, I found my issue was a combination of using ubuntu, kernel 5.3 and having secure boot enabled. Some newer kernels are enabling lockdown mode, which breaks BPF. You can read more at this comment and this bug report: Disabling bpf() syscall on kernel lockdown break apps when secure boot is on |
thanks for comment @mcmcghee, I'm hitting exactly this |
That's it ! @mcmcghee Thank you ! |
Current Behavior
Context
I setup kubernetes 1.15.3 on a fresh intalled cluster by Kubespray
All nodes run Ubuntu 18.04.3 and kernel 5.0.0-29
calico version is 3.7.3 , I also try 3.9.1 and get the same error
I search the code and find the error command is
bpftool map create /sys/fs/bpf/calico/calico_failsafe_ports_v1 type hash key 4 value 1 entries 65535 name calico_failsafe_ports_v1 flags 1
It failed on only one node and after I reinstall docker it also failed
When it will be
Operation not permitted
?Your Environment
The text was updated successfully, but these errors were encountered: