Implement new stable URL semantic conventions #8491

mateuszrzeszutek · 2023-05-15T12:46:43Z

... and use them as part of the HTTP extractors - the HTTP getters now extend the UrlAttributesGetter, and override the methods that HTTP semconv is interested in.

This PR also sets up the OTEL_SEMCONV_STABILITY_OPT_IN env var handling, and adds some tests that cover all 3 possible variants of emitted attribute sets.

...va/io/opentelemetry/instrumentation/api/instrumenter/http/HttpServerAttributesExtractor.java

trask · 2023-05-16T23:10:48Z

...src/main/java/io/opentelemetry/instrumentation/api/instrumenter/url/UrlAttributesGetter.java

+  @Nullable
+  default String getFullUrl(REQUEST request) {
+    return null;
+  }
+
+  /**
+   * Returns the <a href="https://www.rfc-editor.org/rfc/rfc3986#section-3.1">URI scheme</a>
+   * component identifying the used protocol.
+   *
+   * <p>Examples: {@code https}, {@code ftp}, {@code telnet}
+   */
+  @Nullable
+  default String getUrlScheme(REQUEST request) {
+    return null;
+  }


I think I slightly prefer consistency with other *Getter classes of not repeating Url in the method names, e.g.

getFull()

getScheme()

etc

That was my initial idea to.

But, the problem with the HTTP semconv is, it uses other semconv (URL, client, server, network) and slightly overrides how some of the "inherited" attributes are extracted. For example, client.address will be extracted from the Forwarded header if not provided; or server.port will be taken from the Host header, but only if it's not 80/443.

To apply these customizations we either have to pass multiple getters to the extractor (which is what we're doing right now; in the new semconv it would be 5 getters instead of 2 though), or have the HTTP getter extend the "inherited" semconv getters - in which case, something likegetFull() appearing on an HTTP getter would be mildly confusing to the user. Which is why I've chosen to be a bit more verbose here.

...va/io/opentelemetry/instrumentation/api/instrumenter/http/HttpServerAttributesExtractor.java

brunobat · 2023-05-25T15:59:08Z

...src/main/java/io/opentelemetry/instrumentation/api/instrumenter/url/UrlAttributesGetter.java

+   * <p>Examples: {@code /search}
+   */
+  @Nullable
+  default String getUrlPath(REQUEST request) {


Should we also have a target? a low cardinality attribute for things like:
`/search/{something}

For HTTP semconv, http.route contains that kind of information.

In general, these classes are meant to reflect the semantic conventions -- if you want to introduce a new attribute, you might want to start there.

mateuszrzeszutek · 2023-05-26T15:38:25Z

Some notes from the meeting:

The server.*, client.* and network.* namespaces will get their respective getters (and extractors); the HTTP getters will extend them;
Other instrumentations (e.g. RPC, or the upcoming Elasticsearch instrumentation) could simply use the UrlAttributesExtractor/NetworkAttributesExtractor/etc;
We could remove the getFullUrl() method from UrlAttributesGetter and move it entirely to the HttpClientAttributesGetter; this would remove some confusion about attributes that are not really supposed to be emitted by the HTTP client instrumentations but still available in the getter;
The UrlAttributesGetter would be left with just the URL components, without the full URL;
Every getter method would include the full name of the attribute, e.g. getUrlFull, getHttpRequestMethod etc.

trask · 2023-06-02T03:54:32Z

...ntelemetry/instrumentation/api/instrumenter/url/internal/InternalUrlAttributesExtractor.java

+    if (path == null && query == null) {
+      return null;
+    }
+    return (path == null ? "" : path) + (query == null || query.isEmpty() ? "" : "?" + query);


I think this would better handle http://xyz/path? (also related open-telemetry/opentelemetry-java#5501)

Suggested change

return (path == null ? "" : path) + (query == null || query.isEmpty() ? "" : "?" + query);

return (path == null ? "" : path) + (query == null || "?" + query);

I found that some HTTP servers always return a non-null value for query, even if the called URL does not contain the query component.
I'll revert to the old behavior and merge this PR; let's discuss if we want to change this behavior in the old semconv separately (I made the new url.* extractors always add a non-null value for query)

This reverts commit f74dbce.

mateuszrzeszutek requested a review from a team May 15, 2023 12:46

mateuszrzeszutek commented May 15, 2023

View reviewed changes

...va/io/opentelemetry/instrumentation/api/instrumenter/http/HttpServerAttributesExtractor.java Outdated Show resolved Hide resolved

trask reviewed May 16, 2023

View reviewed changes

mateuszrzeszutek force-pushed the url-getter branch from 4e41510 to 07fa74f Compare May 17, 2023 10:30

brunobat reviewed May 25, 2023

View reviewed changes

mateuszrzeszutek force-pushed the url-getter branch 2 times, most recently from 56bc048 to 11d7bc3 Compare May 31, 2023 12:13

This was referenced May 31, 2023

Implement new stable network semantic conventions #8616

Merged

Implement new stable HTTP semantic conventions #8632

Merged

trask approved these changes Jun 2, 2023

View reviewed changes

mateuszrzeszutek enabled auto-merge (squash) June 5, 2023 08:50

Mateusz Rzeszutek added 6 commits June 5, 2023 12:06

Implement new stable URL semantic conventions

4f3ea4e

spotless

ad22c2f

add a new otel.instrumentation.http.prefer-forwarded-url-scheme setting

451cad6

Move getFullUrl() to HttpClientAttributesGetter

741923f

empty url.query

baaef92

Revert "empty url.query"

7c76d2b

This reverts commit f74dbce.

mateuszrzeszutek force-pushed the url-getter branch from 6300950 to 7c76d2b Compare June 5, 2023 10:06

mateuszrzeszutek merged commit 8ee63d4 into open-telemetry:main Jun 5, 2023

mateuszrzeszutek deleted the url-getter branch June 5, 2023 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement new stable URL semantic conventions #8491

Implement new stable URL semantic conventions #8491

mateuszrzeszutek commented May 15, 2023

trask May 16, 2023

mateuszrzeszutek May 17, 2023

brunobat May 25, 2023

mateuszrzeszutek May 25, 2023

mateuszrzeszutek commented May 26, 2023

trask Jun 2, 2023

mateuszrzeszutek Jun 5, 2023

	return (path == null ? "" : path) + (query == null \|\| query.isEmpty() ? "" : "?" + query);
	return (path == null ? "" : path) + (query == null \|\| "?" + query);

Implement new stable URL semantic conventions #8491

Implement new stable URL semantic conventions #8491

Conversation

mateuszrzeszutek commented May 15, 2023

trask May 16, 2023

Choose a reason for hiding this comment

mateuszrzeszutek May 17, 2023

Choose a reason for hiding this comment

brunobat May 25, 2023

Choose a reason for hiding this comment

mateuszrzeszutek May 25, 2023

Choose a reason for hiding this comment

mateuszrzeszutek commented May 26, 2023

trask Jun 2, 2023

Choose a reason for hiding this comment

mateuszrzeszutek Jun 5, 2023

Choose a reason for hiding this comment