You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What is your proposal:
Refactor CPI collection logic, splitting the collection logic for containers that are unrelated.
Why is this needed:
In mixed deployment scenarios, the combination of multi-core CPUs and a large number of pods can lead to concentrated syscall invocations during CPI collection, causing spikes in CPU usage.
Is there a suggested solution, if so, please add it:
By introducing jitter, the CPI collection logic has been diversified, resulting in a more evenly distributed CPU usage pattern.
The text was updated successfully, but these errors were encountered:
@Rouzip By introducing jitter, do you refer to add a small delay to the timing of CPI collection events across different container? For this, maybe one simple strategy to divide containers is by QoS or Priority.
P.S. one of my concern is that jitter might affect the precision of CPI data by reflecting container performance at varied times, leading to potential accuracy loss. Anyway, by intuition, this should be acceptable.
Rouzip
changed the title
[proposal] Optimize CPI collect
[issue] Optimize CPI collect
Feb 5, 2024
Rouzip
changed the title
[issue] Optimize CPI collect
[issue] Optimize CPI collect in perfGroup
Feb 5, 2024
@Rouzip By introducing jitter, do you refer to add a small delay to the timing of CPI collection events across different container? For this, maybe one simple strategy to divide containers is by QoS or Priority.
P.S. one of my concern is that jitter might affect the precision of CPI data by reflecting container performance at varied times, leading to potential accuracy loss. Anyway, by intuition, this should be acceptable.
Thanks for your reply. After double check the code, I found that the high spikes of CPU usage reason is in the wrong implement in perfGroup interface. I will fix this issue soon.
What is your proposal:
Refactor CPI collection logic, splitting the collection logic for containers that are unrelated.
Why is this needed:
In mixed deployment scenarios, the combination of multi-core CPUs and a large number of pods can lead to concentrated syscall invocations during CPI collection, causing spikes in CPU usage.
Is there a suggested solution, if so, please add it:
By introducing jitter, the CPI collection logic has been diversified, resulting in a more evenly distributed CPU usage pattern.
The text was updated successfully, but these errors were encountered: