kube_pod_container_state_started is not returned for some pods #1467

AnastasiaBlack · 2021-04-28T07:51:50Z

What happened:
We have multiple pods (jobs) that run for a short period of time (the time from container start to termination varies from 2-3 seconds to a couple of minutes for each pod). We use a query kube_pod_container_state_started{pod="needePodName"} in Prometheus and it appears that it works for some pods and doesn't show the result for others (while other queries, including kube_pod_start_time show the result for all of the pods).

What you expected to happen:
kube_pod_container_state_started shows result for every pod, and not just for some of them.

How to reproduce it (as minimally and precisely as possible):
Run multiple pods with short-time living containers, check that a query kube_pod_container_state_started to prometheus doesn't show the results for some of the pods.

Anything else we need to know?:
If we check the pods with kubectl describe pod [pod_name] - the values we need (container start time) are present. When we make a query with kube_pod_start_time for every pod - we get the expected result (that is the timestamp when the pod started). But now we want to get the timestamps when the Containers were started.

Environment:

kube-state-metrics version: v.2.0.0
Kubernetes version (use kubectl version): v.1.19.6

The text was updated successfully, but these errors were encountered:

harjas27 · 2021-05-03T15:56:18Z

This metric is captured from the field State.Running.StartedAt in the container status of pod.
In the span of 2-3 seconds, the state of the container changes from Waiting -> Running -> Terminated. Whenever the state is updated, the metrics for that pod are overwritten in the metrics map. Since the state changes to Terminated, the metric is not captured. This might be the reason for the metric not being collected for some of the pods.
As a fix, we can update

kube-state-metrics/internal/store/pod.go

Line 293 in 4009bea

Value: float64((cs.State.Running.StartedAt).Unix()),

to take the value from either cs.State.Running.StartedAt or cs.State.Terminated.StartedAt depending on the state of the container.
@lilic

AnastasiaBlack · 2021-05-05T08:13:18Z

That would be great!

AnastasiaBlack · 2021-06-24T09:12:56Z

Hello! Is there any news on this issue? Will it be implemented in the future?

AnastasiaBlack added the kind/bug Categorizes issue or PR as related to a bug. label Apr 28, 2021

AnastasiaBlack mentioned this issue Apr 28, 2021

kube-state-metrics keep serving stale metrics after extended apiserver outage #694

Closed

harjas27 mentioned this issue Jul 1, 2021

capture start time for containers in terminated state #1519

Merged

k8s-ci-robot closed this as completed in #1519 Aug 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kube_pod_container_state_started is not returned for some pods #1467

kube_pod_container_state_started is not returned for some pods #1467

AnastasiaBlack commented Apr 28, 2021 •

edited

Loading

harjas27 commented May 3, 2021

AnastasiaBlack commented May 5, 2021

AnastasiaBlack commented Jun 24, 2021

kube_pod_container_state_started is not returned for some pods #1467

kube_pod_container_state_started is not returned for some pods #1467

Comments

AnastasiaBlack commented Apr 28, 2021 • edited Loading

harjas27 commented May 3, 2021

AnastasiaBlack commented May 5, 2021

AnastasiaBlack commented Jun 24, 2021

AnastasiaBlack commented Apr 28, 2021 •

edited

Loading