-
-
Notifications
You must be signed in to change notification settings - Fork 564
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pings disappear from some peers to others after one week #3245
Comments
Hello @netandreus Could you reproduce the issue with verbose logging enabled and send me the logs? |
@netandreus Could you send me the public keys of the node-2 and node-3? In the connection mechanism has some logic that depends from the public keys between the peers. Maybe if we know which lane of the algorithm is running on your elk side and on the node side then we can go nearer to the root cause of the issue. |
@pappz sure, how can I fetch the public keys of my nodes? |
The
|
@pappz here they are: node-2:
node-3:
|
Great, thank you! Do you know which restart solved the issue:
|
@pappz It does not matter, what I do at the ELK side, only restarting the Netbird agent ( |
Good morning, @pappz ! Is there something from my side, that could help you? |
Hello @netandreus, How easy is it to reproduce the issue? Could you enable verbose logging on your agent and collect the logs? |
Thank you for your efforts, @pappz ! |
With these commands, you can set the logging level: |
@netandreus
|
I am working on another logic that can better manage the possible anomalies. Is it an option for you to do tests with a custom build with the patches? |
@pappz yes, I can. I should deploy custom build on both nodes? And how can I rollback if somethings go wrong? How can I collect logs? |
I prepared the test version. Here is the package for Linux. If you are using a different OS, I will send you different artifacts. Default installation path is netbird down
cp -a /usr/bin/netbird /usr/bin/netbird.bkp
cat /path/to/downloaded/netbird > /usr/bin/netbird
netbird up Don't forget to set the proper debug level! Logs are in /var/log/netbird. Clean them before testing for easier handling. If you are testing with previous machines (node-2, node-3), no need to update ELK peer. And do not forget, this is just a test version, be careful to use it in production env. I hope this fix will solve your issue but meantime I will dig deeper into this topic. |
Good morning, @pappz ! ![]() |
Strange. Here is the updated link. |
@netandreus could you send to me a debug bundle? You can generate it with this command: I would like to get a better picture of your network-related settings. This package contains all the necessary information. |
@pappz Sure, but I can`t downloaad it. May be some persmssions issue from your side? ![]() When I click to the file - I can only copy name. Maybe you need my google account or something from my side? |
The file that I uploaded is a ZIP archive. I think you opened it by the browser. If you download the full zip and manage it on your machine it would be easier. |
Done.
Done. You can find all files (both logs when error occures on stable version and debug bundle for test version) here: https://drive.google.com/drive/folders/1sRO8GprHSPS5LYgUqWg5iP2wCTHm7PHa?usp=drive_link
I'm deployed test version on the both nodes. Then I restarted netbird on elk node: ![]() And see this one for node-2 at elk ![]() and for node-3 at elk: ![]() |
@pappz and now I can't ping neither node-3 nor node-2 from elk. |
@netandreus |
@pappz sure, we can schedule a call for tomorrow 2025-02-04 from 10:00 GMT+4. I can give you access to Anydesk / ssh to these nodes. Please find me on Telegram - https://t.me/netandreus |
Thank you. I think you need to accept my messages. https://t.me/pzolinb |
Describe the problem
I have 2 peers:
supporting HA-routes to multiple vlans.
Also I have:
My problem is that after a week these nodes (node-2 and node-3) losing connection to ELK node, and ELK node losing connection to node-2 and node-3, I can't ping them by netbird IP addresses. But in the same time I can ping other peers from ELK peer and ping other than ELK peers from node-2 and node-3.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Stable connection between peers.
Are you using NetBird Cloud?
No, I use self-hosted netbird.
NetBird version
0.36.3
NetBird status -dA output:
When I go by ssh to node-3 I see this:
And same from node-2:
Workaround
After I run:
WireGuard connection re-established.
Related issues
Can you please fix this?
The text was updated successfully, but these errors were encountered: