Exemplars prototype #936

cnnradams · 2020-07-23T20:01:02Z

This PR introduces exemplars into the SDK (for details on exemplars, see OTEP #113)

These change are (of course) experimental, as no major backend supports "statistical" exemplars and there is no concrete SDK spec for exemplars yet (only an OTEP). So any feedback is very much appreciated :)

Built off of the views API prototype: #596

jmacd

This looks great!

My approval here should carry only so much weight because I'm not very familiar with the Python SDK but this code generally looks readable and feels familiar. I have been thinking about how precisely I would integrate sampling in the SDK, and came up with two possibilities. One of the possibilities is to implement exemplars in the Aggregators themselves, as you've done, an d another possibility is to implement something in the Accumulator that might support exemplars in a less Aggregator-dependent way. Of course, you're also showing why we want the Aggregator to see the Update, so that Histograms can select exemplars by bucket.

To get this behavior w/ the first approach is straight forward, though it requires the Aggregator API to receive the excluded labels (as you have done), whereas up until now the Aggregators only know about numbers and math--labels are orthogonal to Aggregator state.

I thought about the other possibility (that exemplars can be Accumulator functionality) because it would be nice if we could compose something with any other aggregator to make this work. Imagine a class, named ExemplarAggregator, that can be used to select Aggregators in parallel any other aggregation (e.g., Histogram, Exact, ...). This is less straight forward, because now there's some kind of dependency between exemplar selection and an independent aggregator, and that's why I'm approving your approach.

For the record though, I put up a draft PR that begins this other approach here: open-telemetry/opentelemetry-go#1023. The idea is that the Accumulator can be told which dimensions to include (thus which to exclude) for each metric instrument, and that it therefore knows when to call the sampling manager. See: https://github.com/open-telemetry/opentelemetry-go/pull/1023/files#diff-4c3de179542bac3c5ceacd2090215160R225. I haven't worked out the details.

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

lzchen · 2020-08-04T16:54:44Z

docs/examples/exemplars/README.rst

+    - Exemplars will be picked to represent the input distribution, without unquantifiable bias
+    - A "sample_count" attribute will be set on each exemplar to quantify how many measurements each exemplar represents
+
+See 'statistical_exemplars.ipynb' for the example (TODO: how do I link this?)


Are you not able to just link to the python file instead?

You can also do something like this if you want it inlined in the docs.

.. literalinclude:: basic_trace.py :language: python :lines: 1-

did literalinclude of the python examples, would have preferred a link to the jupyter notebook but this will work for now

lzchen · 2020-08-04T16:55:44Z

docs/examples/exemplars/semantic_exemplars.py

@@ -0,0 +1,84 @@
+# Copyright The OpenTelemetry Authors


Nit: Can we rename basic_meter to metrics and put this folder under that in the examples?

docs/examples/exemplars/README.rst

docs/examples/exemplars/statistical_exemplars.py

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

aabmass · 2020-08-06T19:26:07Z

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

+            self.sample_set[replace_index] = exemplar
+
+    def merge(self, set1, set2):
+        combined = set1 + set2


Looking at the algorithm Josh linked, I don't think merging like this would give a uniform sample. In the extreme case, imagine k = 1 and you have two samplers; sampler1's rand_count = 1 (population size of 1) and sampler2 finishes with rand_count = 10. sampler1's single value would be weighted much more strongly than any of the values sampler2 saw.

E.g. sampler1 see value [0], sampler2 sees values [1, 2, 3, ..., 10] but keeps only k = 1 sample, say, [6]. Randomly sampling k = 1 values from [0, 6] would give [0] 50% of the time, even though it was one of 11 values sampled in the whole population.

I think there is a similar issue when arg_count > k. The wiki page says each item is sampled with probability k/n; if the samplers being merged saw different n, then you would end up giving more weight to the sampler with smaller n.

as discussed offline, just keeping the second set's exemplars with that assumption based on the current SDK implementation. Might need to change in the future but for now that keeps this much simpler

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

aabmass · 2020-08-06T19:48:30Z

I love the Jupyter Notebook! I think it would be awesome to add nbsphinx to the docs setup so you could include the notebook directly in the docs, if others are cool with that.

cnnradams · 2020-08-06T19:50:05Z

I love the Jupyter Notebook! I think it would be awesome to add nbsphinx to the docs setup so you could include the notebook directly in the docs, if others are cool with that.

I actually tried to use nbsphinx, and found that the notebook was awful looking inline, due to its margins being the full page but the docs being only half a page. Maybe if I put it on its own page it would be different?

aabmass · 2020-08-06T19:52:07Z

I actually tried to use nbsphinx, and found that the notebook was awful looking inline, due to its margins being the full page but the docs being only half a page. Maybe if I put it on its own page it would be different?

Ya, not too sure, ive never used it myself. If nothing else works, you could probably render it to HTML then link or embed that somehow?

aabmass

A few more small nits

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

aabmass · 2020-08-07T15:55:42Z

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

+
+    def __init__(
+        self,
+        config: dict,


If you know the exact keys and types, try a TypedDict

We don't know the exact types - it may change depending on what aggregators need to know

You mean for custom aggregators?

the values already used are int, str, bool, list, and there is no restriction on passing in something like a float to the config dict

I don't follow you. Are you saying you would use the same key with different value types depending on the aggregator?

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py

jmacd · 2020-08-08T03:56:40Z

I left a lengthy remark on this topic here:
open-telemetry/opentelemetry-specification#617 (comment)

cnnradams · 2020-11-06T09:07:50Z

Closing for now.

cnnradams requested a review from a team July 23, 2020 20:01

cnnradams force-pushed the exemplars branch 6 times, most recently from ab2c1fa to 0be0964 Compare July 24, 2020 14:43

cnnradams changed the title ~~WIP: Exemplars prototype~~ Exemplars prototype Jul 27, 2020

cnnradams force-pushed the exemplars branch from 85d17da to ac8da0f Compare July 28, 2020 19:10

jmacd approved these changes Aug 4, 2020

View reviewed changes

opentelemetry-sdk/src/opentelemetry/sdk/metrics/export/exemplars.py Show resolved Hide resolved

cnnradams force-pushed the exemplars branch 2 times, most recently from 142c4ea to 654f185 Compare August 4, 2020 14:11

cnnradams changed the base branch from views to master August 4, 2020 14:14

cnnradams force-pushed the exemplars branch from 654f185 to 1aa84b4 Compare August 4, 2020 15:19

lzchen reviewed Aug 4, 2020

View reviewed changes

aabmass reviewed Aug 6, 2020

View reviewed changes

cnnradams force-pushed the exemplars branch from 9845445 to 97778a0 Compare August 6, 2020 23:21

cnnradams added 4 commits August 6, 2020 19:21

Integrate Exemplars with Python SDK

9b59927

linting

c2727b4

semantic -> trace, link to wiki

f3ed3f3

fixes

a404705

cnnradams force-pushed the exemplars branch from 97778a0 to a404705 Compare August 6, 2020 23:23

readme

b158e59

aabmass reviewed Aug 7, 2020

View reviewed changes

cnnradams force-pushed the exemplars branch 2 times, most recently from bc1b06b to e4e8c20 Compare August 7, 2020 17:13

cnnradams force-pushed the exemplars branch from e4e8c20 to 0bb8693 Compare August 7, 2020 17:16

nits

bb2e302

cnnradams force-pushed the exemplars branch from 0bb8693 to bb2e302 Compare August 7, 2020 17:20

codeboten assigned aabmass Sep 24, 2020

cnnradams closed this Nov 6, 2020

fcollonval mentioned this pull request Jul 28, 2024

Metrics: Add support for exemplars #2407

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exemplars prototype #936

Exemplars prototype #936

cnnradams commented Jul 23, 2020

jmacd left a comment

lzchen Aug 4, 2020 •

edited

Loading

aabmass Aug 6, 2020

cnnradams Aug 7, 2020

lzchen Aug 4, 2020

aabmass Aug 6, 2020

cnnradams Aug 6, 2020

aabmass commented Aug 6, 2020

cnnradams commented Aug 6, 2020

aabmass commented Aug 6, 2020

aabmass left a comment

aabmass Aug 7, 2020

cnnradams Aug 7, 2020

aabmass Aug 7, 2020

cnnradams Aug 7, 2020

aabmass Aug 7, 2020

jmacd commented Aug 8, 2020

cnnradams commented Nov 6, 2020

Exemplars prototype #936

Exemplars prototype #936

Conversation

cnnradams commented Jul 23, 2020

jmacd left a comment

Choose a reason for hiding this comment

lzchen Aug 4, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aabmass commented Aug 6, 2020

cnnradams commented Aug 6, 2020

aabmass commented Aug 6, 2020

aabmass left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmacd commented Aug 8, 2020

cnnradams commented Nov 6, 2020

lzchen Aug 4, 2020 •

edited

Loading