Query language transform processor #6985

anuraaga · 2022-01-03T05:51:52Z

Description:
This implements an initial version of a telemetry transform processor that supports arbitrary processing queries. The initial version scopes to

Traces only for now, though the design should work for non-traces
Simple transformation on instants of data without temporal transformations
Only two functions, set and keep for now. This PR is for the framework, and after that can add many functions in followups

For this code I have interfaces like traces.getter instead of more generic getter to apply to multiple signals. The types are mostly passthrough between different parts and I think replacing Span with interface{} would mostly work to simplify things. I'd like to try that when adding a new signal since when initially trying to keep things simple I was losing sanity due to dropping the compile time safety. Or even better could be to leave such a cleanup for Go 1.18.

Link to tracking Issue: open-telemetry/opentelemetry-collector#4444

Testing: Unit tests

Documentation: README

@bogdandrutu @punya

Benchmark results

goos: linuxgoarch: amd64
pkg: github.com/open-telemetry/opentelemetry-collector-contrib/processor/transformprocessor/internal/traces
cpu: Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
BenchmarkTwoSpans
BenchmarkTwoSpans/no_processing
BenchmarkTwoSpans/no_processing-16         	  689240	      1558 ns/op
BenchmarkTwoSpans/set_attribute
BenchmarkTwoSpans/set_attribute-16         	  696786	      1760 ns/op
BenchmarkTwoSpans/keep_attribute
BenchmarkTwoSpans/keep_attribute-16        	  631849	      1882 ns/op
BenchmarkTwoSpans/no_match
BenchmarkTwoSpans/no_match-16              	  611616	      1647 ns/op
BenchmarkTwoSpans/inner_field
BenchmarkTwoSpans/inner_field-16           	  708600	      1702 ns/op
BenchmarkTwoSpans/inner_field_both_spans
BenchmarkTwoSpans/inner_field_both_spans-16         	  561469	      1826 ns/op
BenchmarkHundredSpans
BenchmarkHundredSpans/no_processing
BenchmarkHundredSpans/no_processing-16              	   25196	     47882 ns/op
BenchmarkHundredSpans/set_status_code
BenchmarkHundredSpans/set_status_code-16            	   20630	     58651 ns/op
BenchmarkHundredSpans/hundred_statements
BenchmarkHundredSpans/hundred_statements-16         	    2444	    485470 ns/op
PASS

anuraaga · 2022-01-03T05:52:26Z

processor/transformprocessor/internal/common/parser.go

+package common // import "github.com/open-telemetry/opentelemetry-collector-contrib/processor/transformprocessor/internal/common"
+
+import (
+	"github.com/alecthomas/participle/v2"


Happy to replace with something like goyacc if this library causes any concerns, I appreciated how simple it was to do the parsing with it

jpkrohling · 2022-01-03T10:32:28Z

@anuraaga, will you split this PR to make this easier to review? I'm also interested in hearing more about the relationship between this and the attributes processor (and potentially other similar processors).

bogdandrutu · 2022-01-03T18:51:48Z

I'm also interested in hearing more about the relationship between this and the attributes processor (and potentially other similar processors).

We decided long time ago in multiple SIG meetings that we will replace the current Attribute/Span/MetricsTransform/etc. with a more consistent "transform" (name was not decided) processor that uses a much simpler and consistent language across signals.