-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Define critical metrics of the agent and expose them. #293
Comments
Hello, we have started to look into OCM and came up with a few metrics that we think could potentially be valuable in addition to the existing ones that are in place for the Current Metrics:
Proposed Additional Metrics:
|
cc @bjoydeep this looks very interesting |
Great points @DerekHeldtWerle . Yes, it absolutely makes sense to add more metrics. BTW, in RedHat's productized version of ACM, we do add a few metrics: https://github.com/stolostron/metrics-chronicle/blob/main/docs/acm/component/server-foundation/metrics.md. You will see some overlaps with what you suggested above. I am personally a little ambivalent to 5. However, there is different requirement IMHO which can be great practical help to engineers maintaining the system. If we can collect key life cycle changes in a cluster and publish them as events that can be consumed by non-Kube systems, that may be very helpful. If for example a node has been added or removed from a cluster - |
This issue is stale because it has been open for 120 days with no activity. After 14 days of inactivity, it will be closed. Remove the |
No description provided.
The text was updated successfully, but these errors were encountered: