cgroup v1/v2 compatibility issue when setting memory below the current usage #3509

kolyshkin · 2022-06-14T22:32:16Z

With cgroup v1, when we set the memory limit to below the current usage (runc update on a running container), the kernel returns EBUSY and runc fails with a nice error message:

ERRO[0000] unable to set memory limit to 27033 (current usage: 270336, peak usage: 6082560)

With cgroup v2, when do do this, kernel OOM killer just kill the container. This makes this behavior incompatible with cgroup v1.

One (imperfect) workaround is to add a flag to OCI spec that disallows to set memory limit to the value lower than the current usage. This is borderline ugly but at least in most cases we'll return an error instead of letting the container being OOM killed.

(the other, much less serious part of the problem is, when container is disappearing in the middle of runc update, we get all sorts of ugly messages)

The text was updated successfully, but these errors were encountered:

giuseppe · 2022-06-15T07:17:05Z

could we use memory.high instead of memory.max?

danishprakash · 2022-06-15T14:05:20Z

I don't have a complete understanding at this point but are we talking about cgroup memory limit applied at the time of container creation? And if that's the case, is the difference then the fact that in cgroupv2 the kernel isn't returning an EBUSY anymore?

add a flag to OCI spec

And then have runc parse it and fail early instead of the container being OOMKilled?

mrunalp · 2022-06-15T21:08:35Z

This is when we try to update the memory limit of an already running container to a value that is less than what it is currently using. In v1, we got EBUSY, but in v2, kernel applies the value and if it is low, the container is OOM Killed.

kolyshkin · 2022-06-16T00:27:11Z

could we use memory.high instead of memory.max?

From the vertical pod autoscaler POV -- yes. Meaning, it will still have to distinguish between v1 and v2. Meaning, it does not make sense to add a flag I have proposed in the description.

mrunalp · 2022-06-17T18:33:04Z

could we use memory.high instead of memory.max

I think that will have to be phase 2 with cgroups v2 in k8s. Phase 1 is just a direct mapping to v1.

utam0k · 2022-08-25T01:02:20Z

Is it possible to get the current memory usage from memory.current and if it is lower than that, not update it and return an error? This may be too much help as OCI runtime...?

This setting can be used to mimic cgroup v1 behavior on cgroup v2, when setting the new memory limit during update operation. In cgroup v1, a limit which is lower than the current usage is rejected. In cgroup v2, such a low limit is causing an OOM kill. Ref: opencontainers/runc#3509 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

kamizjw · 2022-09-06T03:16:15Z

Is there a similar problem with other configurations other than memory?

kolyshkin · 2022-09-09T23:18:40Z

Is there a similar problem with other configurations other than memory?

Not that I know of.

mrunalp mentioned this issue Jun 15, 2022

In-place Pod Vertical Scaling feature kubernetes/kubernetes#102884

Merged

AkihiroSuda added area/cgroupv2 area/cgroupv1 kind/bug labels Jun 15, 2022

This was referenced Aug 29, 2022

config-linux: add memory.checkBeforeUpdate opencontainers/runtime-spec#1158

Merged

runc update: implement memory.checkBeforeUpdate #3579

Merged

kolyshkin closed this as completed in #3579 Nov 3, 2022

cyphar mentioned this issue Jul 7, 2023

libct/nsenter: namespace the bindfd shuffle #3599

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cgroup v1/v2 compatibility issue when setting memory below the current usage #3509

cgroup v1/v2 compatibility issue when setting memory below the current usage #3509

kolyshkin commented Jun 14, 2022 •

edited

Loading

giuseppe commented Jun 15, 2022 •

edited

Loading

danishprakash commented Jun 15, 2022 •

edited

Loading

mrunalp commented Jun 15, 2022

kolyshkin commented Jun 16, 2022

mrunalp commented Jun 17, 2022

utam0k commented Aug 25, 2022

kamizjw commented Sep 6, 2022

kolyshkin commented Sep 9, 2022

cgroup v1/v2 compatibility issue when setting memory below the current usage #3509

cgroup v1/v2 compatibility issue when setting memory below the current usage #3509

Comments

kolyshkin commented Jun 14, 2022 • edited Loading

giuseppe commented Jun 15, 2022 • edited Loading

danishprakash commented Jun 15, 2022 • edited Loading

mrunalp commented Jun 15, 2022

kolyshkin commented Jun 16, 2022

mrunalp commented Jun 17, 2022

utam0k commented Aug 25, 2022

kamizjw commented Sep 6, 2022

kolyshkin commented Sep 9, 2022

kolyshkin commented Jun 14, 2022 •

edited

Loading

giuseppe commented Jun 15, 2022 •

edited

Loading

danishprakash commented Jun 15, 2022 •

edited

Loading