New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add container metrics fields from ECS #87

Closed

ChrsMark wants to merge 2 commits into open-telemetry:main from ChrsMark:add_container_metrics

Member

ChrsMark commented Jun 6, 2023 •

edited

Loading

This PR adds container related metrics fields as part of #72.

cc: @AlexanderWert @kaiyan-sheng @mlunadia

ChrsMark changed the title ~~Add container metrics fields from ECS~~ [WIP] Add container metrics fields from ECS

ChrsMark force-pushed the add_container_metrics branch from 8a10c6f to 3dd54a3 Compare

June 19, 2023 09:30

ChrsMark changed the title ~~[WIP] Add container metrics fields from ECS~~ Add container metrics fields from ECS

ChrsMark force-pushed the add_container_metrics branch 3 times, most recently from 995ab45 to 1076e36 Compare

June 19, 2023 09:33

ChrsMark marked this pull request as ready for review

June 19, 2023 09:34

ChrsMark requested review from a team

June 19, 2023 09:34

github-actions bot assigned jsuereth


          Add container metrics fields from ECS

d781478

Signed-off-by: ChrsMark <chrismarkou92@gmail.com>

ChrsMark force-pushed the add_container_metrics branch from 1076e36 to d781478 Compare

June 19, 2023 14:16


          Add specification tables

33d8866

Signed-off-by: ChrsMark <chrismarkou92@gmail.com>

ChrsMark mentioned this pull request

REQUEST: New membership for @ChrsMark open-telemetry/community#1599

Closed

6 tasks

jsuereth reviewed

View reviewed changes

specification/metrics/semantic_conventions/container.md


		## HTTP Server

		### Metric: `metric.container.cpu.usage`

Contributor

jsuereth Aug 2, 2023

The name should just be container.cpu.usage

The metric. classifier is for our YAML database, not the metric name. (Sorry for confusion)

Same comment for all other metrics.

trask reviewed

View reviewed changes

semantic_conventions/metrics/container.yaml

+              groups:
+                - id: metric.container.cpu.usage
+                  type: metric
+                  metric_name: container.cpu.usage

Member

trask Aug 2, 2023 •

edited

Loading

is this needed separate from system.cpu.*?

(there was some related discussion in open-telemetry/opentelemetry-specification#2388)

Member Author

ChrsMark Aug 3, 2023

Thank's @trask! That's interesting one and it's quite similar to open-telemetry/opentelemetry-specification#2388 (comment).

I will try to share my view on this :).
So, from infrastructure's observability point of view, container metrics would be collected from outside the containers themselves (as a best practice).
For example by using the Docker runtime's APIs or Kubelet's API's.

So the point here is that it does not make a lot of sense to treat a container as a host and collect its metrics by running a collector inside the container.

So for metrics collected through the runtime/orchastrator APIs I think we need to be specific and have a resource specific namespace like container.cpu, pod.cpu etc.

In many cases the CPU/Memory resources are limited as well:

In very specific usecases, if someone wants to treat a container as a host (kind uses containers' as hosts) then the collector should be running on the container/host directly and report metrics under the system.* namespace. In that case the resource is a "host" not a container from the observation/collection point of view.

Also related to #226 (comment).

lmolkova reviewed

View reviewed changes

semantic_conventions/metrics/container.yaml

+                    on all network interfaces
+                    by the container since
+                    the last metric collection.
+                  instrument: gauge

Contributor

lmolkova Aug 3, 2023

Should be a counter instead (since values are additive)?

https://github.com/open-telemetry/opentelemetry-specification/blob/065b25024549120800da7cda6ccd9717658ff0df/specification/metrics/supplementary-guidelines.md#instrument-selection

semantic_conventions/metrics/container.yaml

+                    on all network interfaces
+                    by the container since
+                    the last metric collection.
+                  instrument: gauge

Contributor

lmolkova Aug 3, 2023

should it be a counter?

semantic_conventions/metrics/container.yaml

+                    The total number of bytes written
+                    successfully (aggregated from all disks)
+                    since the last metric collection
+                  instrument: gauge

Contributor

lmolkova Aug 3, 2023

should it be a counter? (https://github.com/open-telemetry/opentelemetry-specification/blob/065b25024549120800da7cda6ccd9717658ff0df/specification/metrics/supplementary-guidelines.md#instrument-selection)

semantic_conventions/metrics/container.yaml

+                    The total number of bytes read
+                    successfully (aggregated from all disks)
+                    since the last metric collection.
+                  instrument: gauge

Contributor

lmolkova Aug 3, 2023

and here: it's a counter

ChrsMark mentioned this pull request

Add container metric fields (from ECS) #282

Merged

Member Author

ChrsMark commented Aug 24, 2023 •

edited

Loading

Closing in honor of #282. After the project restructuring it was hard to rebase and recover this branch due to some weird permission errors. Starting fresh was faster.

ChrsMark closed this

ChrsMark mentioned this pull request

Document the difference between a host and a system metric #226

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet