Add support for Doctrine auto-instrumentation #300

DominicDetta · 2024-10-03T07:05:38Z

No description provided.

…m errors

linux-foundation-easycla · 2024-10-03T07:05:44Z

The committers listed above are authorized under a signed CLA.

✅ login: DominicDetta / name: domaz (046e810, 2e6c142, 32d028b, 011365e, 5feb61d, 42e52c4)

codecov · 2024-10-03T07:40:54Z

Codecov Report

Attention: Patch coverage is 96.39640% with 4 lines in your changes missing coverage. Please review.

Project coverage is 79.51%. Comparing base (c06f8c6) to head (32d028b).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
...mentation/Doctrine/src/DoctrineInstrumentation.php	96.39%	4 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main     #300      +/-   ##
============================================
- Coverage     80.35%   79.51%   -0.84%     
+ Complexity     1025      626     -399     
============================================
  Files            98       68      -30     
  Lines          4112     2738    -1374     
============================================
- Hits           3304     2177    -1127     
+ Misses          808      561     -247

Flag	Coverage Δ
Aws	`85.51% <ø> (-0.24%)`	⬇️
Context/Swoole	`?`
Instrumentation/CakePHP	`20.00% <ø> (ø)`
Instrumentation/CodeIgniter	`73.94% <ø> (ø)`
Instrumentation/Doctrine	`96.39% <96.39%> (?)`
Instrumentation/ExtAmqp	`89.58% <ø> (ø)`
Instrumentation/Guzzle	`69.73% <ø> (ø)`
Instrumentation/HttpAsyncClient	`81.33% <ø> (ø)`
Instrumentation/IO	`70.90% <ø> (ø)`
Instrumentation/MongoDB	`77.33% <ø> (ø)`
Instrumentation/OpenAIPHP	`86.82% <ø> (ø)`
Instrumentation/PDO	`89.56% <ø> (ø)`
Instrumentation/Psr14	`78.12% <ø> (ø)`
Instrumentation/Psr15	`93.50% <ø> (ø)`
Instrumentation/Psr16	`97.50% <ø> (ø)`
Instrumentation/Psr18	`82.08% <ø> (ø)`
Instrumentation/Psr3	`?`
Instrumentation/Psr6	`97.61% <ø> (ø)`
Instrumentation/Slim	`86.95% <ø> (ø)`
Instrumentation/Symfony	`89.07% <ø> (+0.03%)`	⬆️
Instrumentation/Yii	`77.77% <ø> (ø)`
Logs/Monolog	`?`
Propagation/ServerTiming	`100.00% <ø> (ø)`
Propagation/TraceResponse	`100.00% <ø> (ø)`
ResourceDetectors/Container	`93.02% <ø> (ø)`
Sampler/RuleBased	`32.14% <ø> (ø)`
Shims/OpenTracing	`92.99% <ø> (ø)`
Symfony	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
...mentation/Doctrine/src/DoctrineInstrumentation.php	`96.39% <96.39%> (ø)`

... and 33 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c06f8c6...32d028b. Read the comment docs.

brettmc · 2024-10-03T07:42:35Z

@DominicDetta one more thing to add is an entry to the top-level .gitsplit.yaml. When this is merged, that'll be used to split the package into a read-only repository which we can point packagist at.
You'll also need to add an entry to .github/workflows/php.yml so that the tests will run.

DominicDetta · 2024-10-03T08:23:23Z

@brettmc I included the Doctrine directory in the files you specified

DominicDetta · 2024-10-03T08:41:25Z

I agreed the EasyCLA and waiting for the merge approval

brettmc · 2024-10-04T02:10:34Z

Can you also update workflows/php to exclude 7.4 from the version matrix? The php version requirement is ^8.2, but I think it would work back to 8.0 since it doesn't hook internal functions. Either way, we need to either drop the requirement back to 8.0 or exclude pre-8.2 versions from the test matrix.

DominicDetta · 2024-10-04T11:17:51Z

it's done

DominicDetta · 2024-10-04T12:00:18Z

In the end I integrated again the PHP 7.4 version and excluded Doctrine from the pre-8.2 versions tests

brettmc · 2024-10-04T13:22:02Z

Green build is an excellent start. I'm happy to push this out as a 0.0.1 release. Since this works against doctrine 3 (according to the composer.json), and doesn't hook any internal/extension functions, I think it would probably work in 8.0 and 8.1 as well. But, that's not a blocker to getting this out if you're happy with the PR as it is.

DominicDetta · 2024-10-04T13:33:04Z

I'm pretty confident it works even with PHP 8.0 version.
I can confirm that the classes \Doctrine\DBAL\Driver\Connection and \Doctrine\DBAL\Driver exist even in the next major versions of doctrine. (I'm using doctrine 3 at the moment). I'm glad I was able to contribute.

Nevay

Hooking driver methods will cause duplicated spans if a middleware like Symfony Debug Middleware is used.
An alternative approach to avoid this problem would be to move the implementation into a middleware and use a hook to inject this middleware into every connection; see also Nevay/otel-instrumentation-doctrine-dbal.

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

Nevay · 2024-10-04T15:42:31Z

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

+                $builder
+                    ->setAttribute(TraceAttributes::SERVER_ADDRESS, $params[0]['host'] ?? 'unknown')
+                    ->setAttribute(TraceAttributes::SERVER_PORT, $params[0]['port'] ?? 'unknown')
+                    ->setAttribute(TraceAttributes::DB_SYSTEM, $params[0]['driver'] ?? 'unknown')


IMO we should try to follow semconv even if we don't specify a schema url; see semconv:

db.system has the following list of well-known values. If one of them applies, then the respective value MUST be used

Ok, I didnt know there was a convention. I will apply the correct ones.

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

Co-authored-by: Tobias Bachert <git@b-privat.de>

DominicDetta · 2024-10-07T14:31:58Z

@brettmc @Nevay

Hooking driver methods will cause duplicated spans if a middleware like Symfony Debug Middleware is used. An alternative approach to avoid this problem would be to move the implementation into a middleware and use a hook to inject this middleware into every connection; see also Nevay/otel-instrumentation-doctrine-dbal.

The duplicate problem is also a common problem with other auto-instrumentations. In fact, we could not use psr-15 as it generated too many spans. I wonder if I really need to fix this problem or is there a way to filter the results.
Does OpenTelemetry have a feature to configure and filter duplicated spans?

dkarlovi · 2024-10-09T08:39:57Z

Does OpenTelemetry have a feature to configure and filter duplicated spans?

AFAIK the idea is to allow the agent to do that:
https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/processor/filterprocessor/README.md

src/Instrumentation/Doctrine/_register.php

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

cedricziel

My proposal is to figure out if this instrumentation should depend on the existence of PDO and then define how the two interact. My understanding is that if a PDO is used and the PDO instrumentation is active, there will be 2 nested CLIENT spans for the same interaction. - IMHO this is severely impacting the user experience.

cedricziel · 2024-10-16T06:26:13Z

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

+            pre: static function (\Doctrine\DBAL\Driver $driver, array $params, string $class, string $function, ?string $filename, ?int $lineno) use ($instrumentation) {
+                /** @psalm-suppress ArgumentTypeCoercion */
+                $builder = self::makeBuilder($instrumentation, 'Doctrine\DBAL\Driver::connect', $function, $class, $filename, $lineno)
+                    ->setSpanKind(SpanKind::KIND_CLIENT);


How do we think about the duality of PDO and dbal here? Most instrumented methods have a pendant in the PDO instrumentation i.e. - a compatible pdo driver will cause two client spans and the hierarchy will probably be

my_func (internal) -> connect (client) # from dbal -> PDO::__construct (client ) # from pdo

IMO the two should be either mutually exclusive or enrich each other. WDYT?

We do not have a way for two auto-instrumentations to cooperate like this (it's an idea that has come up before, and would be cool). So I think mutually exclusive is the best we have - or at least it's up to the user to install only one instrumentation, I don't think we need to go as far as setting them up as conflicting in composer.
Is documenting that recommendation good enough for now?

edit: a hacky approach would be to have this package check if PDO instrumentation is installed and enabled, and only apply some of its hooks (if it provides any value that PDO itself doesn't)...?

Could the PDO instrumentation re-write the doctrine span from CLIENT to internal or bail if it recognizes the parent is already client?

To your point - i think documenting it makes the experience worse because users then need to make a whole lot of decisions when instrumenting

i think documenting it makes the experience worse

I only meant documenting such as "use pdo or doctrine auto-instrumentation, but probably not both" - I expect users to make decisions about which auto-instrumentations to install based on their workload/stack...just installing everything available is probably too noisy.

Could the PDO instrumentation re-write the doctrine span from CLIENT to internal or bail if it recognizes the parent is already client?

I think the issue here would be that pre/post hooks operate in isolation, so there's currently no way for a post hook to know whether the pre hook created a span or just modified the active span. All the implementations we have assume that pre created and activated a span, and post just closes whichever is the active span at the time.
We have previously brainstormed whether we could manage some state between a pre and post hook, but we thought it was going to be a hard problem.

I expect users to make decisions about which auto-instrumentations to install based on their workload/stack...just installing everything available is probably too noisy.

I completely agree on this matter.

Could the PDO instrumentation ... bail if it recognizes the parent is already client?

Bailing out requires storing additional data within the Context to be able to detect duplicate instrumentations, see Javas span suppression strategies:
Instrumentation span suppression behavior
SpanSuppressionStrategy.java
SpanSuppressors.java

AFAIK the idea is to allow the agent to do that

Dropping at the agent is too late as it may lead to broken traces "Dropping a span may lead to orphaned spans if the dropped span is a parent." ref.

there's currently no way for a post hook to know whether the pre hook created a span or just modified the active span

Scopes attached via Context::storage()->attach() implement ArrayAccess to allow propagating state from pre hook to post hook ref.

hook( null, 'example', static function() use ($tracer): void { $context = Context::getCurrent(); if (lcg_value() > .5 /* some suppression condition */) { $span = $tracer ->spanBuilder('example') ->startSpan(); $context = $span->storeInContext($context); } $scope = Context::storage()->attach($context); $scope[SpanInterface::class] = $span ?? null; }, static function(): void { if (!$scope = Context::storage()->scope()) { return; } $scope->detach(); /** @var SpanInterface|null $span */ $span = $scope[SpanInterface::class] ?? null; $span?->end(); }, );

cedricziel · 2024-10-16T06:29:11Z

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

+            'query',
+            pre: static function (\Doctrine\DBAL\Driver\Connection $connection, array $params, string $class, string $function, ?string $filename, ?int $lineno) use ($instrumentation) {
+                /** @psalm-suppress ArgumentTypeCoercion */
+                $builder = self::makeBuilder($instrumentation, 'Doctrine\DBAL\Driver\Connection::query', $function, $class, $filename, $lineno)


The recommended span name here would be something like SELECT my_table.

The duplication with the pdo instrumentation is the same here.

https://opentelemetry.io/docs/specs/semconv/database/database-spans/#name

Ok. Do you know a good agnostic SQL parser to achieve this? The one I found is focusing on Mysql syntax.

Doctrine already knows the table name(s).

Doctrine already knows the table name(s).

With the approach I'm following right now, I dont have this information. Are you referring to Doctrine ORM? I'm creating hooks on the methods of \Doctrine\DBAL\Driver and \Doctrine\DBAL\Driver\Connection

I see, that's a good point.

The issue with this approach is whichever parser you choose is bound to be quite slow and will probably be slowest part of the call (which includes the actual query), making it unusable for production use. ORM itself has a DQL query parser (which is close enough for high level comparison) and it's basically unusable in prod without it being cached with ORM query cache.

Running a random query parser on every DB query is unlikely to be viable.

I think I will use a simple regex expression to identify the db.operation.name and target from the query text

IMO we could consider the ORM instrumentation even if it's still not here yet.

For example, since ORM will know what the table names are, you could assume they will be put in context by the ORM instrumentation and then only try to detect it if not there? This would future proof this implementation and allow extensions when they happen.

Sorry for the late response but I'm pretty busy at the moment.

IMO we could consider the ORM instrumentation even if it's still not here yet.

For example, since ORM will know what the table names are, you could assume they will be put in context by the ORM instrumentation and then only try to detect it if not there? This would future proof this implementation and allow extensions when they happen.

I see your point but my focus is to implement hooks for Doctrine DBAL and in the future time permitted we could refactor the functions to behave as you suggested.

feat(#1393): implement integration test and suppress phpstan and psal…

5feb61d

…m errors

DominicDetta requested a review from a team as a code owner October 3, 2024 07:05

DominicDetta mentioned this pull request Oct 3, 2024

[opentelemetry-php-contrib] Add support for Doctrine library open-telemetry/opentelemetry-php#1393

Open

feat(#1393): add split readonly repo and github workflows

42e52c4

ci: drop workflows PHP 7.4 version

2e6c142

ci: exclude pre-8.2 versions from github workflows

011365e

Nevay reviewed Oct 4, 2024

View reviewed changes

DominicDetta and others added 2 commits October 7, 2024 15:48

Update src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

046e810

Co-authored-by: Tobias Bachert <git@b-privat.de>

Update src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php

32d028b

Co-authored-by: Tobias Bachert <git@b-privat.de>

ostrolucky reviewed Oct 11, 2024

View reviewed changes

src/Instrumentation/Doctrine/_register.php Show resolved Hide resolved

src/Instrumentation/Doctrine/src/DoctrineInstrumentation.php Show resolved Hide resolved

cedricziel suggested changes Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Doctrine auto-instrumentation #300

Add support for Doctrine auto-instrumentation #300

DominicDetta commented Oct 3, 2024

linux-foundation-easycla bot commented Oct 3, 2024 •

edited

Loading

codecov bot commented Oct 3, 2024 •

edited

Loading

brettmc commented Oct 3, 2024

DominicDetta commented Oct 3, 2024

DominicDetta commented Oct 3, 2024

brettmc commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

brettmc commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

Nevay left a comment

Nevay Oct 4, 2024

DominicDetta Oct 7, 2024 •

edited

Loading

DominicDetta commented Oct 7, 2024 •

edited

Loading

dkarlovi commented Oct 9, 2024

cedricziel left a comment

cedricziel Oct 16, 2024

brettmc Oct 16, 2024 •

edited

Loading

cedricziel Oct 16, 2024

cedricziel Oct 16, 2024

brettmc Oct 16, 2024

DominicDetta Oct 16, 2024

Nevay Oct 16, 2024

cedricziel Oct 16, 2024

DominicDetta Oct 16, 2024

dkarlovi Oct 16, 2024

DominicDetta Oct 17, 2024

dkarlovi Oct 17, 2024

DominicDetta Oct 17, 2024

dkarlovi Oct 23, 2024

DominicDetta Oct 23, 2024

Add support for Doctrine auto-instrumentation #300

Are you sure you want to change the base?

Add support for Doctrine auto-instrumentation #300

Conversation

DominicDetta commented Oct 3, 2024

linux-foundation-easycla bot commented Oct 3, 2024 • edited Loading

codecov bot commented Oct 3, 2024 • edited Loading

Codecov Report

brettmc commented Oct 3, 2024

DominicDetta commented Oct 3, 2024

DominicDetta commented Oct 3, 2024

brettmc commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

brettmc commented Oct 4, 2024

DominicDetta commented Oct 4, 2024

Nevay left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DominicDetta Oct 7, 2024 • edited Loading

Choose a reason for hiding this comment

DominicDetta commented Oct 7, 2024 • edited Loading

dkarlovi commented Oct 9, 2024

cedricziel left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brettmc Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

linux-foundation-easycla bot commented Oct 3, 2024 •

edited

Loading

codecov bot commented Oct 3, 2024 •

edited

Loading

DominicDetta Oct 7, 2024 •

edited

Loading

DominicDetta commented Oct 7, 2024 •

edited

Loading

brettmc Oct 16, 2024 •

edited

Loading