You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 30, 2020. It is now read-only.
I have an example where the systemd file is gone, but the unit is still running. This happened because the kill command in the systemd file did not succeed. Since this code does not check for error it happily removes all the systemd files. Now that the systemd files are gone I cannot stop it through fleet since fleet cannot call the systemd stop because it already deleted the systemd file.
The text was updated successfully, but these errors were encountered:
In Agent.unloadUnit(), if systemdUnitManager.TriggerStop() returns
any error, do not unload systemd units. Otherwise the unit could get
into a state where the unit cannot be stopped via fleet, because the
unit file was already removed.
Fixescoreos#1216
I have noticed a race condition in production where job is supposed to be unloaded, but never stops.
I think I found the bug here
fleet/agent/agent.go
Line 98 in 7a64877
I have an example where the systemd file is gone, but the unit is still running. This happened because the kill command in the systemd file did not succeed. Since this code does not check for error it happily removes all the systemd files. Now that the systemd files are gone I cannot stop it through fleet since fleet cannot call the systemd stop because it already deleted the systemd file.
The text was updated successfully, but these errors were encountered: