-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need metric to return job failed reason #947
Comments
If there is a bound number of reasons I think this would be ok to have. |
@AdityaMisra do you mind looking up the number of reasons first for failed jobs, so we can make sure does not have a high unbound cardinality. Otherwise go ahead. :) /assign @AdityaMisra |
This is also needed for normal pods, not only jobs. |
@juliantaylor Feel free to open a separate issue for the pod, as its a different resource makes sense to discuss that separately. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
@AdityaMisra Are you still working on this? |
@RajatVaryani I'm working on it. Will be generating PR soon. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale There is a PR for this. |
Is this a BUG REPORT or FEATURE REQUEST?:
/kind feature
What happened:
Missing a metric like
kube_job_status_failed_reason
What you expected to happen:
Add
kube_job_status_failed_reason
to display the failed reason, such asEvicted
How to reproduce it (as minimally and precisely as possible):
Create a cronjob but limit the resource
Anything else we need to know?:
K8S returns the following in the Evicted job
We would like to filter out the Evicted jobs, even if it's failed.
Environment:
kubectl version
):The text was updated successfully, but these errors were encountered: