Memory Leak in Kubernetes Client #5626

cmdjulian · 2023-12-01T15:58:40Z

Describe the bug

When using Jobs in combination with LogWatches, since version 6.9.0 a memory leak exists.
I found out, as the memory gradually increases over time. I see the retained heap size in the old gen is getting bigger and bigger. After debugging for a day, I feel like the leak comes from the informer functionality and that the used CompletableFuture is not properly teared down. This creates a lot of back references which the GC can't drop. However, was not fully able to nail it down to the exact location.

I created a demo project which causes the fault to appear quite frequently after a few minutes.
Please also see the heap dump I attached showing the problem here.

heapdump.zip

project.zip

My workaround atm is to just stay on 6.8.1, which seems to not have a memory leak.

Fabric8 Kubernetes Client version

6.9.2

Steps to reproduce

Run demo project
Wait 5 mins
look at the heap dump

Expected behavior

no memory leak :D

Runtime

other (please specify in additional context)

Kubernetes API Server version

1.25.3@latest

Environment

Linux

Fabric8 Kubernetes Client Logs

No response

Additional context

k3d 1.25.3

shawkins · 2023-12-01T16:55:05Z

The first thing that I did was convert your example to just java 17 - removing the task executor override and using just a cached thread pool for running the jobs in main. At least for me after 10 minutes the memory usage just exhibited a normal pattern of gc, where the heap was not growing. So I suspect the issue is with using virtual threads for the task executor - can you cofirm this?

cmdjulian · 2023-12-01T17:40:10Z

I disagree, I adjusted my code for using java 17 without virtual Threads like you did,

v2.zip

dump.zip

After running for 10 minutes with count 16 I see the client occupies now most of the heap, it will grow more and more. The absolute count of a few MB is not that big, but the relative count, in contrast to all the other classes gets absurd. Its the same pattern as with the virtual threads. When switching to 6.8.1 I don't see anything like this:

shawkins · 2023-12-01T18:34:15Z

Ok, upping the thread count made the issue more appearent over a short interval. The problem is with the auto-closure logic - it's adding a task to ensure the informer is closed if the client is closed, but there's nothing cleaning that up when the informer is closed naturally.

shawkins self-assigned this Dec 1, 2023

rohanKanojia mentioned this issue Dec 2, 2023

fix: prevents memory accumulation from informers #5627

Merged

11 tasks

manusa added the bug label Dec 4, 2023

manusa added this to the 6.10.0 milestone Dec 4, 2023

manusa closed this as completed in #5627 Dec 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory Leak in Kubernetes Client #5626

Memory Leak in Kubernetes Client #5626

cmdjulian commented Dec 1, 2023 •

edited

Loading

shawkins commented Dec 1, 2023

cmdjulian commented Dec 1, 2023 •

edited

Loading

shawkins commented Dec 1, 2023

Memory Leak in Kubernetes Client #5626

Memory Leak in Kubernetes Client #5626

Comments

cmdjulian commented Dec 1, 2023 • edited Loading

Describe the bug

Fabric8 Kubernetes Client version

Steps to reproduce

Expected behavior

Runtime

Kubernetes API Server version

Environment

Fabric8 Kubernetes Client Logs

Additional context

shawkins commented Dec 1, 2023

cmdjulian commented Dec 1, 2023 • edited Loading

shawkins commented Dec 1, 2023

cmdjulian commented Dec 1, 2023 •

edited

Loading

cmdjulian commented Dec 1, 2023 •

edited

Loading