Intermittent failures with "Signalted to exit!" message #591

Minoru · 2020-03-05T18:35:15Z

Question

(I'm not convinced this is a bug in Cirrus, so filing a question).

For a few weeks now, my jobs have been failing sometimes with "Signaled to exit!" at the end of their log, like this: https://cirrus-ci.com/task/6101281580253184 Most of the time this happens on FreeBSD, but sometimes with Docker on Linux as well: https://cirrus-ci.com/task/4998716163620864

I thought it might be caused by OOM killer. Local builds in Docker succeed with 2GB, so I used that as a limit on Cirrus. Still sporadic failures. I then used 3GB, and now 4GB, and still seeing failures.

A similar issue have been reported earlier (#137), and it looks like it was a bug in Cirrus and it was resolved. So I guess I'm running into something different.

Is there anything I can do to debug this?

Minoru · 2020-03-05T18:38:23Z

Not sure if that's related, but here's a job that was killed twice now with OOM: https://cirrus-ci.com/task/6598968113102848 It's just a few code formatters, it never exceeded its allotted 256MB before. (Worked fine with manual re-run.)

fkorotkov · 2020-03-05T19:56:55Z

I only saw similar behaviour when execution environment is killed due to OOM. Cirrus only detect OOMs for containers at the moment. Cirrus agent is invoked inside a container or a VM and therefore killed with it in case of OOMs and logs Signalted to exit! when it's killed from the outside. For containers Cirrus checks with Kubernetes for container status before cleaning it up and adds this OOM warning.

#93 should help to get better visibility in your builds but from my experiences it's usually related to tools that don't respect cgroup limits of containers. Quite a few tools still are not aware of being executed in a containers and by default picks up resources of a host and not the containers. Cirrus uses 32 CPU / 96GB host VMs for running containers. Maybe your tool thinks that it has that many resources instead of 1 CPU / 0.25 GBs?

Minoru · 2020-03-05T20:09:57Z

Maybe your tool thinks that it has that many resources instead of 1 CPU / 0.25 GBs?

We did have this issue, but I fixed it a while ago.

I think I'll try to create a VM or something that'll emulate that many cores, and see if I can reproduce the issue there. Thanks!

Minoru added the question label Mar 5, 2020

Minoru closed this as completed Mar 5, 2020

jsiwek mentioned this issue Mar 5, 2020

FreeBSD tasks signaled to exit, not restarted, likely not OOM ? #592

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intermittent failures with "Signalted to exit!" message #591

Intermittent failures with "Signalted to exit!" message #591

Minoru commented Mar 5, 2020

Minoru commented Mar 5, 2020 •

edited

Loading

fkorotkov commented Mar 5, 2020

Minoru commented Mar 5, 2020

Intermittent failures with "Signalted to exit!" message #591

Intermittent failures with "Signalted to exit!" message #591

Comments

Minoru commented Mar 5, 2020

Question

Minoru commented Mar 5, 2020 • edited Loading

fkorotkov commented Mar 5, 2020

Minoru commented Mar 5, 2020

Minoru commented Mar 5, 2020 •

edited

Loading