Include the Tail Sampling Processor #1229

seh · 2024-03-27T14:56:51Z

Motivating Problem

When running the OpenTelemetry Collector alongside a Lambda function, it is difficult to coordinate running separate tail sampling proxies like Honeycomb Refinery, for lack of a suitable container orchestration system being available. It would be beneficial to perform tail sampling within the OpenTelemetry Collector running alongside each Lambda function as well.

Proposed Solution

Make the Tail Sampling Processor available as a processor in the OpenTelemetry Collector Lambda layer's build.

Alternatives Considered

Set up a Kubernetes or ECS cluster to run Honeycomb Refinery, and export traces from each OpenTelemetry Collector to Refinery for tail sampling. However, for a system that relies solely on AWS Lambda functions, establishing that separate environment within which to run Refinery is difficult.

Additional Context

I don't know the change in "weight" in the size of the built executable file results from adding this new processor.

tylerbenson · 2024-05-02T19:31:35Z

@seh any interest in contributing a PR for this?

seh · 2024-05-04T14:54:23Z

@seh any interest in contributing a PR for this?

Yes, though it would help to have an example that shows similar precedent, in order to help get started and show how large such a patch is likely to become. Do any other components introduced like this come to mind?

tylerbenson · 2024-05-04T15:36:31Z

Maybe a combination of #959 and #1046?

serkan-ozal · 2024-09-01T20:52:22Z

@seh @tylerbenson

In the tail sampling processor, it is expected that spans with the same trace id should go to the same collector instance, because sampling decision is made at the end of trace over all spans in that trace. When spans in the same trace go to different collector instance, tail sampling processor will not work properly and there will likely be partial traces because of partial sampling of spans in the same trace.

In our case, since different Lambda functions and instances can be part of the same trace (flow), exporting their spans to different Lambda collectors for tail sampling will be problematic.

So, I am not sure how we expect from tail sampling processor to run in the Lambda collector instance. Am I missing anything here?

seh · 2024-09-01T21:10:33Z

I expect that you're correct here, as, by analogy—or maybe prior art—Honeycomb's Refinery servers coordinate "ownership" of traces by tracking the peer count and identity via Redis. Without coordination like that, lots of separate collectors each making independent decisions wouldn't work very well.

For my case, though, I expect the "entire" trace—at least the spans pertinent to this part of our system—to come from a given Lambda function instance, such that I could still make simple local sampling decisions usefully, such as "keep every trace with an error but keep no more than one of twenty of the rest".

tylerbenson · 2024-09-04T17:30:46Z

Reasonable argument. Going to close this, but reopen if you feel you have another idea how to proceed.

seh added the enhancement New feature or request label Mar 27, 2024

tylerbenson closed this as not planned Won't fix, can't repro, duplicate, stale Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include the Tail Sampling Processor #1229

Include the Tail Sampling Processor #1229

seh commented Mar 27, 2024

tylerbenson commented May 2, 2024

seh commented May 4, 2024

tylerbenson commented May 4, 2024

serkan-ozal commented Sep 1, 2024

seh commented Sep 1, 2024

tylerbenson commented Sep 4, 2024

Include the Tail Sampling Processor #1229

Include the Tail Sampling Processor #1229

Comments

seh commented Mar 27, 2024

Motivating Problem

Proposed Solution

Alternatives Considered

Additional Context

tylerbenson commented May 2, 2024

seh commented May 4, 2024

tylerbenson commented May 4, 2024

serkan-ozal commented Sep 1, 2024

seh commented Sep 1, 2024

tylerbenson commented Sep 4, 2024