Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[chassis][supervisor] [master]database-chassis.service failed to start at reboot on Supervisor #20715

Closed
mlok-nokia opened this issue Nov 6, 2024 · 6 comments
Assignees
Labels
regression Triaged this issue has been triaged

Comments

@mlok-nokia
Copy link
Contributor

Description

On master branch, database-chassis.service start failed at boot up on Supervisor. The following syslog shows the failure info

2024-11-01T15:17:25.148322+00:00 sonic systemd[1]: database-chassis.service: Control process exited, code=exited, status=1/FAILURE
2024-11-01T15:17:25.148322+00:00 sonic systemd[1]: database-chassis.service: Control process exited, code=exited, status=1/FAILURE
2024-11-01T15:17:25.148518+00:00 sonic systemd[1]: database-chassis.service: Failed with result 'exit-code'.
2024-11-01T15:17:25.148518+00:00 sonic systemd[1]: database-chassis.service: Failed with result 'exit-code'.
2024-11-01T15:17:25.149312+00:00 sonic systemd[1]: Failed to start database-chassis.service - database-chassis container.
2024-11-01T15:17:25.149312+00:00 sonic systemd[1]: Failed to start database-chassis.service - database-chassis container.
2024-11-01T15:17:25.149678+00:00 sonic systemd[1]: Dependency failed for config-topology.service - Platform topology configuration service.
2024-11-01T15:17:25.149734+00:00 sonic systemd[1]: Dependency failed for config-setup.service - Config initialization and migration service.
2024-11-01T15:17:25.149678+00:00 sonic systemd[1]: Dependency failed for config-topology.service - Platform topology configuration service.
2024-11-01T15:17:25.149734+00:00 sonic systemd[1]: Dependency failed for config-setup.service - Config initialization and migration service.
2024-11-01T15:17:25.149782+00:00 sonic systemd[1]: Dependency failed for swss@14.service - switch state service.
2024-11-01T15:17:25.149782+00:00 sonic systemd[1]: Dependency failed for swss@14.service - switch state service.

Steps to reproduce the issue:

  1. Just reboot supervisor with the latest master image, and check the syslog. The following syslog
2024-11-01T15:17:25.148322+00:00 sonic systemd[1]: database-chassis.service: Control process exited, code=exited, status=1/FAILURE
2024-11-01T15:17:25.148322+00:00 sonic systemd[1]: database-chassis.service: Control process exited, code=exited, status=1/FAILURE
2024-11-01T15:17:25.148518+00:00 sonic systemd[1]: database-chassis.service: Failed with result 'exit-code'.
2024-11-01T15:17:25.148518+00:00 sonic systemd[1]: database-chassis.service: Failed with result 'exit-code'.
2024-11-01T15:17:25.149312+00:00 sonic systemd[1]: Failed to start database-chassis.service - database-chassis container.
2024-11-01T15:17:25.149312+00:00 sonic systemd[1]: Failed to start database-chassis.service - database-chassis container.
2024-11-01T15:17:25.149678+00:00 sonic systemd[1]: Dependency failed for config-topology.service - Platform topology configuration service.
2024-11-01T15:17:25.149734+00:00 sonic systemd[1]: Dependency failed for config-setup.service - Config initialization and migration service.
2024-11-01T15:17:25.149678+00:00 sonic systemd[1]: Dependency failed for config-topology.service - Platform topology configuration service.
2024-11-01T15:17:25.149734+00:00 sonic systemd[1]: Dependency failed for config-setup.service - Config initialization and migration service.
2024-11-01T15:17:25.149782+00:00 sonic systemd[1]: Dependency failed for swss@14.service - switch state service.
2024-11-01T15:17:25.149782+00:00 sonic systemd[1]: Dependency failed for swss@14.service - switch state service.

Describe the results you received:

database-chasis.service failed to start at reboot

Describe the results you expected:

It should not be failed.

Output of show version:

(paste your output here)

Output of show techsupport:

(paste your output here or download and attach the file here )

Additional information you deem important (e.g. issue happens only occasionally):

@arlakshm
Copy link
Contributor

arlakshm commented Nov 6, 2024

@mlok-nokia to add more error logs.

@arlakshm arlakshm added the Triaged this issue has been triaged label Nov 6, 2024
@anamehra
Copy link
Contributor

anamehra commented Nov 7, 2024

Hi @rlhui , as discussed in community meeting, Cisco SIM sanities are also failing with latest master code. While going though the recent commits, I came across #19016
Reverting this in our run passed SIM sanity. I will validate this on h/w as well.

Hi @mlok-nokia , could you please try with this PR reverted?

@anamehra
Copy link
Contributor

anamehra commented Nov 7, 2024

We also validated the build successfully on Cisco h/w with #19016 reverted.

@arlakshm
Copy link
Contributor

#20726 might have the potential fix. Please test with this change

@mlok-nokia
Copy link
Contributor Author

mlok-nokia commented Nov 15, 2024

#20726 might have the potential fix. Please test with this change

@arlakshm This PR fixes the /etc/supervisor/critical_processes file in "database" container. It does not fix the database-chassis.service issue.

@mlok-nokia
Copy link
Contributor Author

Fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
regression Triaged this issue has been triaged
Projects
Status: Done
4 participants