API usability improvements in sampling package #31940

jmacd · 2024-03-25T17:19:22Z

Description:

Several usability issues were ironed out while working on #31894. This PR is the pkg/sampling changes from that PR.

Highlights:

Adds NeverSampleThreshold value, which is the exclusive upper-boundary of threshold values. This makes negative sampling decisions easier to manage, as shown in [probabilistic sampling processor] encoded sampling probability (support OTEP 235) #31894.
Adds AllProbabilitiesRandomness value, which is the inclusive upper-boundary of Randomness values. This makes error handling more natural as shown in [probabilistic sampling processor] encoded sampling probability (support OTEP 235) #31894. All thresholds except NeverSampleThreshold will be sampled at AllProbabilitiesRandomness.
Adds UnsignedToThreshold constructor for explicit threshold construction. This is useful in the case of [probabilistic sampling processor] encoded sampling probability (support OTEP 235) #31894 because it constructs a 14-bit threshold value.
Adds UnsignedToRandomness constructor for explicit randomness construction. This is useful in the case of [probabilistic sampling processor] encoded sampling probability (support OTEP 235) #31894 because it constructs randomness values from log records w/o use of TraceState.
Removes a parameter from UpdateTValueWithSampling to avoid the potential for an inconsistent update through mis-use (as identified in [probabilistic sampling processor] encoded sampling probability (support OTEP 235) #31894, there is less optimization potential because sampling.threshold modifies thresholds in all modes).
Eliminates the ErrPrecisionUnderflow error condition and automatically corrects the problem by extending precision near 0.0 and 1.0 where there are obligatory leading f or 0 digits.

Link to tracking Issue: #31918

Testing: New tests added for coverage.

Documentation: New comments to explain.

to iterate.

…tor-contrib into jmacd/tvaluesampler

…n#3602

…and resampler

…tor-contrib into jmacd/tvaluesampler

…ype similar to configcomprsesion.CompressionType

…tor-contrib into jmacd/tvaluesampler

jmacd · 2024-03-25T17:20:44Z

pkg/sampling/doc.go

-//
-//		return fixedThreshold.ShouldSample(rnd)
-//	}
+// func MakeDecision(tracestate string, tid TraceID) bool {


Note: the Go toolchain is reformatting this comment block for me. 🤷

jmacd · 2024-03-25T17:28:55Z

pkg/sampling/probability.go

+	// Calculate the amount of precision needed to encode the
+	// threshold with reasonable precision.  Here, we count the
+	// number of leading `0` or `f` characters and automatically
+	// add precision to preserve relative error near the extremes.
+	//
+	// Frexp() normalizes both the fraction and one-minus the
+	// fraction, because more digits of precision are needed if
+	// either value is near zero.  Frexp returns an exponent <= 0.
+	//
+	// If `exp <= -4`, there will be a leading hex `0` or `f`.
+	// For every multiple of -4, another leading `0` or `f`
+	// appears, so this raises precision accordingly.
+	_, expF := math.Frexp(fraction)
+	_, expR := math.Frexp(1 - fraction)
+	precision = min(NumHexDigits, max(precision+expF/-hexBits, precision+expR/-hexBits))
+
+	// Compute the threshold
+	scaled := uint64(math.Round(fraction * float64(MaxAdjustedCount)))
+	threshold := MaxAdjustedCount - scaled
+
+	// Round to the specified precision, if less than the maximum.
+	if shift := hexBits * (NumHexDigits - precision); shift != 0 {
+		half := uint64(1) << (shift - 1)
+		threshold += half
+		threshold >>= shift
+		threshold <<= shift


Note I removed ErrPrecisionUnderflow because it was not a meaningful error condition -- I would otherwise work around the problem. While investigating the head-sampler changes for OTel-Go I found a much nicer solution, copied here (with new testing).

jmacd · 2024-03-25T23:24:19Z

Reviewers: the new API surface is exercised in #31946.

github-actions · 2024-04-09T05:19:17Z

This PR was marked stale due to lack of activity. It will be closed in 14 days.

MovieStoreGuy · 2024-04-09T08:02:13Z

Sorry @jmacd for the long time waiting,

Just rerunning the build and hopefully we should be all good to merge this in.

…ation, prepare for OTEP 235 support (#31946) **Description:** Refactors the probabilistic sampling processor to prepare it for more OTEP 235 support. This clarifies existing inconsistencies between tracing and logging samplers, see the updated README. The tracing priority mechanism applies a 0% or 100% sampling override (e.g., "1" implies 100% sampling), whereas the logging sampling priority mechanism supports variable-probability override (e.g., "1" implies 1% sampling). This pins down cases where no randomness is available, and organizes the code to improve readability. A new type called `randomnessNamer` carries the randomness information (from the sampling pacakge) and a name of the policy that derived it. When sampling priority causes the effective sampling probability to change, the value "sampling.priority" replaces the source of randomness, which is currently limited to "trace_id_hash" or the name of the randomess-source attribute, for logs. While working on #31894, I discovered that some inputs fall through to the hash function with zero bytes of input randomness. The hash function, computed on an empty input (for logs) or on 16 bytes of zeros (which OTel calls an invalid trace ID), would produce a fixed random value. So, for example, when logs are sampled and there is no TraceID and there is no randomness attribute value, the result will be sampled at approximately 82.9% and above. In the refactored code, an error is returned when there is no input randomness. A new boolean configuration field determines the outcome when there is an error extracting randomness from an item of telemetry. By default, items of telemetry with errors will not pass through the sampler. When `FailClosed` is set to false, items of telemetry with errors will pass through the sampler. The original hash function, which uses 14 bits of information, is structured as an "acceptance threshold", ultimately the test for sampling translated into a positive decision when `Randomness < AcceptThreshold`. In the OTEP 235 scheme, thresholds are rejection thresholds--this PR modifies the original 14-bit accept threshold into a 56-bit reject threshold, using Threshold and Randomness types from the sampling package. Reframed in this way, in the subsequent PR (i.e., #31894) the effective sampling probability will be seamlessly conveyed using OTEP 235 semantic conventions. Note, both traces and logs processors are now reduced to a function like this: ``` return commonSamplingLogic( ctx, l, lsp.sampler, lsp.failClosed, lsp.sampler.randomnessFromLogRecord, lsp.priorityFunc, "logs sampler", lsp.logger, ) ``` which is a generic function that handles the common logic on a per-item basis and ends in a single metric event. This structure makes it clear how traces and logs are processed differently and have different prioritization schemes, currently. This structure also makes it easy to introduce new sampler modes, as shown in #31894. After this and #31940 merge, the changes in #31894 will be relatively simple to review as the third part in a series. **Link to tracking Issue:** Depends on #31940. Part of #31918. **Testing:** Added. Existing tests already cover the exact random behavior of the current hashing mechanism. Even more testing will be introduced with the last step of this series. Note that #32360 is added ahead of this test to ensure refactoring does not change results. **Documentation:** Added. --------- Co-authored-by: Kent Quirk <kentquirk@gmail.com>

jmacd added 30 commits May 12, 2023 15:20

Add t-value sampler draft

e822a9b

copy/import tracestate parser package

1bc6017

test ot tracestate

d1fd891

tidy

85e4472

renames

bb75f8a

testing two parsers w/ generic code

6a57b77

integrated

7fa8130

Comments

36230e7

revert two files

7bae35c

Update with r, s, and t-value. Now using regexps and strings.IndexByte()

9010a67

to iterate.

fix sampler build

0e27e40

add support for s-value for non-consistent mode

efcdc3d

WIP

939c758

Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…

b9a1e56

…tor-contrib into jmacd/tvaluesampler

use new proposed syntax see open-telemetry/opentelemetry-specificatio…

a31266c

…n#3602

update tracestate libs for new encoding

690cd64

wip working on probabilistic sampler with two new modes: downsampler …

c8baf29

…and resampler

unsigned implement split

7f47e4a

two implementations

422e0b2

wip

787b9fd

Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…

ed36f03

…tor-contrib into jmacd/tvaluesampler

Updates for OTEP 235

d795210

wip TODO

09000f7

versions.yaml

a4d467b

Add proportional sampler mode; comment on TODOs; create SamplerMode t…

e373b9b

…ype similar to configcomprsesion.CompressionType

back from internal

fe6a085

wip

396efb1

fix existing tests

36de5dd

:wip:

f1aa0ad

Update for rejection threshold

700734e

jmacd added 6 commits March 25, 2024 08:52

Merge branch 'main' of github.com:open-telemetry/opentelemetry-collec…

fdd26ac

…tor-contrib into jmacd/tvaluesampler

revert these

5097f43

Remove new files too

aaff323

comments

dd979b9

chlog

34004a2

add tests

f202a57

github-actions bot added the pkg/sampling label Mar 25, 2024

github-actions bot requested a review from kentquirk March 25, 2024 17:20

jmacd commented Mar 25, 2024

View reviewed changes

comment

d401076

jmacd commented Mar 25, 2024

View reviewed changes

jmacd marked this pull request as ready for review March 25, 2024 17:31

jmacd requested review from a team and mx-psi March 25, 2024 17:31

github-actions bot assigned MovieStoreGuy Mar 25, 2024

fix that

2e41e69

jmacd mentioned this pull request Mar 25, 2024

Refactor the probabilistic sampler processor; add FailClosed configuration, prepare for OTEP 235 support #31946

Merged

github-actions bot added the Stale label Apr 9, 2024

MovieStoreGuy removed the Stale label Apr 9, 2024

MovieStoreGuy approved these changes Apr 9, 2024

View reviewed changes

Merge branch 'main' into jmacd/pkgsamplingup

eddffa8

MovieStoreGuy merged commit beef35e into open-telemetry:main Apr 9, 2024
169 of 170 checks passed

github-actions bot added this to the next release milestone Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API usability improvements in sampling package #31940

API usability improvements in sampling package #31940

jmacd commented Mar 25, 2024 •

edited

Loading

jmacd Mar 25, 2024

jmacd Mar 25, 2024

jmacd commented Mar 25, 2024

github-actions bot commented Apr 9, 2024

MovieStoreGuy commented Apr 9, 2024

API usability improvements in sampling package #31940

API usability improvements in sampling package #31940

Conversation

jmacd commented Mar 25, 2024 • edited Loading

jmacd Mar 25, 2024

Choose a reason for hiding this comment

jmacd Mar 25, 2024

Choose a reason for hiding this comment

jmacd commented Mar 25, 2024

github-actions bot commented Apr 9, 2024

MovieStoreGuy commented Apr 9, 2024

jmacd commented Mar 25, 2024 •

edited

Loading