-
Notifications
You must be signed in to change notification settings - Fork 249
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Produce metrics
for consumers
#1162
Comments
Thanks @jcpunk for the suggestion. I think this would definitely be a worthy enhancement. We would just need to understand what kind of metrics to advertise 🧐 On the nfd-master side we could have some counters and/or gauges identifying e.g. the number of nodes updated, CRDs processed etc. We could also expose the time spent on updating nodes on the nfd-master side and the time spent in doing the feature discovery on the nfd-worker side. ping @ArangoGutierrez @PiotrProkop @zvonkok @Tal-or any thoughts? |
On worker side, maybe advertise if the worker can't reach master, even tho this is logged already. |
/assign |
/milestone v0.14 |
@ArangoGutierrez is working on an implementation. @jcpunk @PiotrProkop @zvonkok @fmuyassarov @Tal-or any wishes/thoughts on which kind of metrics to expose. I listed some above |
I'm not really sure what metrics would be useful... |
@marquiz on the topology-updater side we can expose the following:
Adding @swatisehgal and @ffromani as they might have more ideas on the subject |
What would you like to be added:
Some telemetry information about the daemons running. I'm not sure what metrics would make sense for this daemonset - perhaps the flag stats on the node? But having something I can monitor is handy.
Why is this needed:
I generally don't like to deploy things I can't monitor. Having native telemetry available may also help troubleshoot oddities in the wild.
The text was updated successfully, but these errors were encountered: