Add http metrics for content length #2567

jack-berg · 2022-05-20T19:10:03Z

It's very useful to have metrics about http request and response content length, for both http clients and servers.

I propose the addition of the following metrics:

Name	Instrument Type	Unit	Description
`http.server.request_content_length`	Histogram	By	Measures the size of the HTTP request payload
`http.server.response_content_length`	Histogram	By	Measures the size of the HTTP request payload
`http.client.request_content_length`	Histogram	By	Measures the size of the HTTP request payload
`http.client.response_content_length`	Histogram	By	Measures the size of the HTTP request payload

The dimensions that are useful for http.*.duration are useful in analyzing distributions of request / response size as well, so I would suggest extending the same set of attributes to these new metrics.

Arguably, this type of information fits into the io naming conventions, in which case we'd have metrics of the form http.*.io, with an additional attribute indicating whether the direction is request or response. However, I don't see any precedent for histogram io metrics, and in the case of http content lengths, the distribution is important.

If accepted, it would be natural to extend these same metrics to rpc.* as well.

The text was updated successfully, but these errors were encountered:

tigrannajaryan · 2022-05-20T20:41:56Z

It's very useful to have metrics about http request and response content length, for both http clients and servers.

Any reason to name http.server.request_content_* and not http.request_content_* if they are both for clients and servers?

jack-berg · 2022-05-20T21:45:36Z

Any reason to name http.server.request_content_* and not http.request_content_* if they are both for clients and servers?

If you're running an http server that also calls downstream http servers, you need some way to differentiate between requests the server is handling, versus requests the server is producing as a client. The scope names of the client and server metrics will be different, but requiring users to be aware of the scope name is bad ergonomics IMO.

arminru · 2022-05-23T14:55:35Z

An alternative would be to encode client vs server in an attribute/dimension. For spans we have SpanKind as a dedicated property on the span. For metrics we could consider adding a generic attribute semantic convention (which might be set directly using a Convenience API) for specifying a "party" or "counterpart" kind.
The same idea was also brought up in the scope of RPC: #2419

jack-berg · 2022-05-23T15:13:06Z

Yeah I think these metrics should follow whatever conventions exist for http server / client metrics, and rpc server / client metrics. Right now, the difference between server and client is encoded in the metric name (i.e. http.server.duration and http.client.duration vs. rpc.server.duration and rpc.client.duration). If that pattern changes generally, these metrics should follow suit.

tigrannajaryan · 2022-05-24T13:17:26Z

If you're running an http server that also calls downstream http servers, you need some way to differentiate between requests the server is handling, versus requests the server is producing as a client.

This is an interesting problem. For spans we solve this by requiring a server span and a client span for each of these operations. For metrics we don't have the exact equivalent notion (span kind). Perhaps this should be recorded as a dimension (e.g. direction=in/out)?

jack-berg · 2022-05-27T20:48:33Z

I think separate metric names are preferable to attributes because:

It would be strange to want to analyze the aggregate of request / response bytes the service processed in its capacity as an http server with request / response bytes the service processed in its capacity as an http client.
The instruments for server http metrics and client http metrics will almost certainly fall under different instrumentation scopes, since rarely does a library / framework work as both a client and server. So even if the metrics for server and client shared the same name, they're under different scopes and probably shouldn't be aggregated.
After closer inspection, rpc already has metrics for this concept under names rpc.server.*.size and rpc.client.*.size. I've followed this convention in PR Add metrics for http request and response size #2588.

tigrannajaryan · 2022-05-30T13:33:15Z

It would be strange to want to analyze the aggregate of request / response bytes the service processed in its capacity as an http server with request / response bytes the service processed in its capacity as an http client.

You mean aggregation over "direction" dimension would not be meaningful, right? I think I agree (in some cases perhaps the average between "in" and "out" could be considered somewhat meaningful as a "throughput" indication? I don't think it is very useful though).

We have a bunch of other metrics that have a direction dimension (e.g. read/write for disk metrics). I wonder if this is a mistake to have a dimension there as well. I submitted an issue to look at it #2589

tigrannajaryan · 2022-05-30T13:39:33Z

The instruments for server http metrics and client http metrics will almost certainly fall under different instrumentation scopes, since rarely does a library / framework work as both a client and server. So even if the metrics for server and client shared the same name, they're under different scopes and probably shouldn't be aggregated.

This is another interesting point. If we wanted to allow this then different instrumentations would need to create instruments with the same name but different Scope (different library name - server vs client)), and each library would create data points only for the particular value of the attribute. I am not sure if I understand what our API says about what happens in this case. It seems like we can't have the same Metric with different Scopes, so there is really no way to emit such data using our API.

jack-berg · 2022-06-01T19:34:41Z

You mean aggregation over "direction" dimension would not be meaningful, right? I think I agree (in some cases perhaps the average between "in" and "out" could be considered somewhat meaningful as a "throughput" indication? I don't think it is very useful though).

In this case there are 4 directions: An http server has request and response sizes to track, and and http client has request and response sizes to track. I think its unlikely (yet still possible) that you'd want to analyze http server request and response size together. Same applies to http client request and response size. However, I can't think of a scenario in which you'd want to analyze server AND client request and response size together.

If we wanted to allow this then different instrumentations would need to create instruments with the same name but different Scope (different library name - server vs client)), and each library would create data points only for the particular value of the attribute.

This is allowed. Instruments with the same name under different scopes are distinct and won't produce any conflict or error scenario. Relevant language from the api. Relevant language from the datamodel.

For the purpose of this conversation, this means that there is no benefit to giving server and client instruments the same name, since they'll be collected using different scopes, and will therefore be distinct instruments regardless of whether they share the same name.

tigrannajaryan · 2022-06-01T20:54:00Z

For the purpose of this conversation, this means that there is no benefit to giving server and client instruments the same name, since they'll be collected using different scopes, and will therefore be distinct instruments regardless of whether they share the same name.

Thanks for confirming, that's what I wasn't sure about our metric API.

jack-berg added area:semantic-conventions Related to semantic conventions spec:metrics Related to the specification/metrics directory labels May 20, 2022

github-actions bot assigned tigrannajaryan May 20, 2022

arminru added the semconv:HTTP label May 23, 2022

jack-berg mentioned this issue May 27, 2022

Add metrics for http request and response size #2588

Merged

tigrannajaryan added the [label deprecated] triaged-accepted [label deprecated] Issue triaged and accepted by OTel community, can proceed with creating a PR label Jun 21, 2022

carlosalberto closed this as completed in #2588 Jun 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add http metrics for content length #2567

Add http metrics for content length #2567

jack-berg commented May 20, 2022

tigrannajaryan commented May 20, 2022

jack-berg commented May 20, 2022

arminru commented May 23, 2022

jack-berg commented May 23, 2022

tigrannajaryan commented May 24, 2022

jack-berg commented May 27, 2022

tigrannajaryan commented May 30, 2022

tigrannajaryan commented May 30, 2022

jack-berg commented Jun 1, 2022

tigrannajaryan commented Jun 1, 2022

Add http metrics for content length #2567

Add http metrics for content length #2567

Comments

jack-berg commented May 20, 2022

tigrannajaryan commented May 20, 2022

jack-berg commented May 20, 2022

arminru commented May 23, 2022

jack-berg commented May 23, 2022

tigrannajaryan commented May 24, 2022

jack-berg commented May 27, 2022

tigrannajaryan commented May 30, 2022

tigrannajaryan commented May 30, 2022

jack-berg commented Jun 1, 2022

tigrannajaryan commented Jun 1, 2022