-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove the monit_syncd, from docker-syncd-brcm-dnx. #8003
Conversation
Thanks @yxieca for identifying this from nightly test logs, verified config load_minigraph works ok on a sonic device without this file monit_syncd. |
it file should be only in dnx image, why is this affecting xgs image at all? |
I did some checks on why this happened. Both the dnx/xgs images and their respective docker-syncd & docker-syncd-dnx are build together. The docker package name for xgs/dnx is different, but the container/process names is same for both.
So keeping the same monit_syncd file in both docker-syncd-brcm and docker-syncd-brcm-dnx was ok, as the docker/process names are same. But when we copy these files while building the image - in files/build_templates/sonic_debian_extension.j2, we copy all the monit files present to /etc/monit/conf.d/ as below
Since we include both the docker-syncd-brcm.mk & docker-syncd-brcm-dnx.mk in the platform/broadcom/rules.mk, the monit_syncd got copied to both the XGS and DNX builds and we didn't differentiate monit_syncd in each of the dockers as the processes they monitor were with same name. |
this seems problematic, can we fix it? |
Since we do build both the syncd/syncd-dnx & broadcom/broadcim-dnx images in the same make command flow -- the approach taken was
This approach helped to differentiate though we include docker-syncd-brcm.mk & docker-syncd-brcm-dnx.mk in the platform/broadcom/rules.mk. Let me know your thoughts - thanks. |
Remove the references to file monit_syncd from docker-syncd-brcm-dnx, which got missed as the PR sonic-net#7598 overlapped sonic-net#7676
Why I did it
Remove the file platform/broadcom/docker-syncd-brcm-dnx/base_image_files/monit_syncd, from new docker "docker-syncd-brcm-dnx" which was added recently via #7598
There was a activity for monit in PR #7676, which was not updated in the PR #7598 ( both PRs were worked upon in parallel )
How I did it
How to verify it
This was resulting in config load_minigraph to fail. Verified after removing the file /etc/monit/conf.d/monit_syncd
Which release branch to backport (provide reason below if selected)
Description for the changelog
A picture of a cute animal (not mandatory but encouraged)