sklearn instrumentation package #1054

crflynn · 2020-08-30T23:31:36Z

Description

Provides an opentelemetry instrumentation package for sklearn models, instrumenting internal spans at the estimator level. The motivation is to provide observability into machine learning models that run for realtime predictive applications that have many complex transformers and predictors.

The instrumentor adds spans to sklearn estimators according to a set of default estimator methods (namely fit, predict, predict_proba and transform) and other configuration parameters that determine how spans are implemented through the model hierarchy. The default configuration also handles Pipeline and FeatureUnion hierarchies. Since sklearn's API is easily extended, the configuration parameters allow for custom model hierarchy traversal, allowing spans to be implemented in custom estimators as well.

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

How Has This Been Tested?

The package provides two tests for the implementation.

test_span_properties uses an sklearn model fixture and asserts span names, kinds, and parent-child relationships.
test_attrib_config uses the same fixture to assert implementation of non-default configuration parameters.

I also have an example implementation here: https://github.com/crflynn/opentelemetry-sklearn

Checklist:

Followed the style guidelines of this project
Changelogs have been updated
Unit tests have been added
Documentation has been updated

linux-foundation-easycla · 2020-08-30T23:31:39Z

The committers are authorized under a signed CLA.

✅ Flynn (1f4cb32, 5302647)

codeboten · 2020-09-22T21:49:30Z

hello and welcome @crflynn! Please sign the CLA, I'll review afterwards.

instrumentation/opentelemetry-instrumentation-sklearn/CHANGELOG.md

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py

crflynn · 2020-09-28T04:32:21Z

I'm not sure how to get the docs to build. Sphinx doesn't seem to want to cooperate with the type hints I've provided. I have a nitpick_ignore for the sklearn BaseEstimator but that seems to not be enough.

ocelotl

I'm not sure how to get the docs to build. Sphinx doesn't seem to want to cooperate with the type hints I've provided. I have a nitpick_ignore for the sklearn BaseEstimator but that seems to not be enough.

I looked into this issue, apparently sphinx can't find a class for some reason. It seems like a similar issue as this one, which has a workaround here. Nevertheless, I tried it and it did not solve the documentation problem. Apparently the root cause between this issue is not the same as the one the workaround is for, since sklearn.base.BaseEstimator is not being imported into any other module in the sklearn for Sphinx to import from this new location. Will look further into this.

ocelotl · 2020-10-01T23:18:55Z

instrumentation/opentelemetry-instrumentation-sklearn/tests/test_sklearn.py

+
+class TestSklearn(TestBase):
+    def test_package_instrumentation(self):
+        ski = SklearnInstrumentor(packages=["sklearn"])


Is it necessary to pass ["sklearn"] to the constructor here? I am under the impression that this is done by default because of this.

It's not necessary, no. I can remove it.

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py

ocelotl · 2020-10-01T23:43:02Z

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py

+
+        The base class' new method passes args and kwargs. We override because
+        we init the class with configuration and Python raises TypeError when
+        additional arguments are passed to the object.__new__() method.


This seems like an issue in the base class implementation of the singleton mechanism, actually... will have to look into this.

ocelotl · 2020-10-01T23:50:18Z

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py

+                for method_name in self.methods:
+                    if hasattr(klass, method_name):
+                        self._instrument_class_method(
+                            estimator=klass, method_name=method_name


Just a concern, what if there is another instrumentation also installed for the same package that is passed as an argument for this instrumentor? Would that cause double instrumentation?

The only way it wouldn't is if that other instrumentation package used the same strategy, i.e. resulted in a True return value from the _check_instrumented method. Order of instrumentation might also matter in that regard.

Related, one of the things I tried to do here was abstract the span decorator as an instrumentor arg with the idea that multiple instrumentations (say otel + datadog) could be applied to estimators by just instantiating multiple instrumentors with different decorators. However, this isn't feasible with the current code because of how instrumentation is applied with the _original_xxx attributes and the _check_instrumented method.

crflynn · 2020-10-02T03:29:05Z

The challenge comes from this decorator which delegates attributes to sub-estimators. I've gotten this to work for instances of estimators but the current iteration doesn't quite work on patching classes. So there is still a bit of work on the package instrumentation side.

After iterating for a few days I'm a bit stuck on instrumenting the library. Particularly I'm having a hard time patching methods which are class attributes (rather than instance attributes) because of the if_delegate_has_method decorator which exists on some metaestimators. This decorator acts as a conditional property which delegates if and only specific instance attributes exist.

The problem is here where obj is None. Instances created from the class with the patched method will attempt to call the lambda returned by the descriptor, but with the first argument being None, resulting in a call with a bad signature.

>   out = lambda *args, **kwargs: self.fn(obj, *args, **kwargs)
E   TypeError: predict() takes 2 positional arguments but 3 were given

I could omit these methods, as I do for properties, but I believe spanning these methods is important because they delegate to other internal estimators and it would obfuscate some of the model hierarchy if they were missing.

NathanielRN · 2020-10-22T04:45:17Z

Hello! I have a PR to move some files you have in this PR to the Contrib repo, please let me know if this gets merged before the PR in the Contrib repo. Please see https://github.com/open-telemetry/opentelemetry-python-contrib/pulls/

lzchen · 2020-10-22T14:51:14Z

@crflynn

I could omit these methods, as I do for properties, but I believe spanning these methods is important because they delegate to other internal estimators and it would obfuscate some of the model hierarchy if they were missing.

Is there value in having the per model instrumentation way have all the methods patched, but having the whole package instrumentation omit some of these methods? We can just update our documentations to reflect this. This way at least we have functionality for both.

crflynn · 2020-10-29T01:28:34Z

I've got a solution for the autoinstrumentation delegation problems in the latest push, which passes the tests locally. It seems though that

sphinx still doesn't like BaseEstimator as an arg type in docstrings
otel 0.14.dev0 packages has been removed/replaced and won't install (just for sklearn tests?)
instrumentation looks like it is being moved to a separate repo

I think this is a lot closer now. Let me know how we should go from here.

instrumentation/opentelemetry-instrumentation-sklearn/CHANGELOG.md

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py

lzchen

Nice! Thanks for the contrib.

lzchen · 2020-11-05T15:26:42Z

@NathanielRN

This PR probably going to be affected the migration.

crflynn · 2020-11-05T15:50:32Z

Should I just move this over to the contrib repo?

NathanielRN · 2020-11-05T15:51:44Z

@crflynn Yes please! That would be super helpful. The contrib repo is ready to accept new PRs like these :)

crflynn · 2020-11-05T16:27:19Z

@NathanielRN I'll work on that later today

codeboten · 2020-11-05T16:36:55Z

@ocelotl can you review again, would like to get this merged

lzchen · 2020-11-05T18:28:01Z

@crflynn
Hey thanks for agreeing to move this to the contrib repo. I'll be closing this PR for now.
@ocelotl Once the new PR is created, you can put your review there.

crflynn requested a review from a team August 30, 2020 23:31

crflynn force-pushed the opentelemetry-sklearn branch from 39a8af0 to 342c88c Compare August 31, 2020 00:37

codeboten added the instrumentation Related to the instrumentation of third party libraries or frameworks label Sep 3, 2020

sklearn instrumentation

1f4cb32

crflynn force-pushed the opentelemetry-sklearn branch from 342c88c to 1f4cb32 Compare September 5, 2020 13:10

crflynn added 5 commits September 22, 2020 21:13

more flexible instrumentation

5302647

Merge branch 'master' into opentelemetry-sklearn

42e256f

update deps

8fb526a

update package ver

32af450

fix docstring

b6461fd

lzchen reviewed Sep 23, 2020

View reviewed changes

instrumentation/opentelemetry-instrumentation-sklearn/CHANGELOG.md Outdated Show resolved Hide resolved

lzchen reviewed Sep 23, 2020

View reviewed changes

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py Show resolved Hide resolved

lzchen reviewed Sep 23, 2020

View reviewed changes

.../opentelemetry-instrumentation-sklearn/src/opentelemetry/instrumentation/sklearn/__init__.py Outdated Show resolved Hide resolved

crflynn added 3 commits September 23, 2020 20:22

remove changelog entry

33d1081

update for autoinstrumentation and uninstrumentation

63bd6cd

rm f-strings

2e330eb

ocelotl reviewed Oct 1, 2020

View reviewed changes

update package instrumentation test

07b024a

format

9c55e99

lzchen assigned lzchen and ocelotl Oct 15, 2020

handle instrumentation of delegators

de241d3

crflynn force-pushed the opentelemetry-sklearn branch from d90763f to de241d3 Compare October 29, 2020 01:10