You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We wrote a <100 line POC python script to monitor memory
pressure and kill stuff way before such pathological thrashing leads
to full system losses that require forcible hard resets.
At some point I should look more into this and replace the current algorithm (based in swapin*swapout) with the new /proc/pressure/memory-statistics. However, /proc/pressure is missing in most of the production servers I'm having responsibility for, so it will take a long time before the backward-compatible algorithm can be obsoleted.
IMHO PSI is maybe best metrics to detect thrashing.
https://lwn.net/Articles/759658/
https://facebookmicrosites.github.io/psi/
You can try to use it to detect thrashing instead of vmstat.
PSI file example (/proc/pressure/memory):
Use
total
metrics.The text was updated successfully, but these errors were encountered: