topdown: Pass metrics to built-ins and record http.send latency #2629

tsandall · 2020-08-13T18:03:12Z

Previously the built-in functions had no way to record metrics for
performance monitoring or other purposes. With this change, built-in
functions can manipulate the evaluation metrics. Initially, only the
http.send function has been updated to report latency.

Fixes #2034

Signed-off-by: Torin Sandall torinsandall@gmail.com

patrick-east · 2020-08-13T19:00:06Z

topdown/http.go

+var (
+	allowedKeys              = ast.NewSet()
+	requiredKeys             = ast.NewSet(ast.StringTerm("method"), ast.StringTerm("url"))
+	httpSendLatencyMetricKey = "rego_" + ast.HTTPSend.Name


Would it make sense to define a shared builtin key prefix or something? Could maybe have a helper on the BuiltinContext or something.

🤔 That being said maybe we do it when we have >1, for a single metric it doesn't really matter too much. I just want to make sure we keep them uniform since they are (sort of) user facing names.

Changed to rego_builtin_http.send.

patrick-east · 2020-08-13T19:15:18Z

topdown/http.go

@@ -64,6 +67,8 @@ const httpSendBuiltinCacheKey httpSendKey = "HTTP_SEND_CACHE_KEY"

 func builtinHTTPSend(bctx BuiltinContext, args []*ast.Term, iter func(*ast.Term) error) error {

+	bctx.Metrics.Timer(httpSendLatencyMetricKey).Start()


Any thoughts around having a metric for cache hits too? As-is on subsequent calls the time would still be accumulated in here too, but it might be hard for someone to tell how much time the actual http call took versus http call plus some (potentially unknown?) number of cache hits that are accumulated in this metric.

I wonder too if we would want a mechanism to allow for providing more details about what the metric is, like.. if there are two http.send calls in some policy for different URLs or something it may not make sense to add their times together under a single timer. The tricky thing is that we dont always want to shove like the full call context into the name of the metric.. but having some way to map it back to it might be nice. Not really sure I have a great suggestion though. Brainstorming a little bit.. maybe we have an ID or something to map the metric to trace node? I guess maybe it would require having some like metadata or something to tag on the metric.

Again though.. maybe we cross that bridge when we get there.

Any thoughts around having a metric for cache hits too?

I was assuming that cache hit latency would be negligible compared to cache miss latency. From my point of view, the current implementation will be useful for telling when the query is CPU vs. IO bound. That's one of the main issues I've encountered with policies that use http.send--when you run into performance problems, you want to divide-and-conquer and quickly identifying whether it was an http.send seems useful. I don't think this is going to useful for profiling individual http.send callls. LMKWYT.

Works for me 👍

Previously the built-in functions had no way to record metrics for performance monitoring or other purposes. With this change, built-in functions can manipulate the evaluation metrics. Initially, only the http.send function has been updated to report latency. Fixes open-policy-agent#2034 Signed-off-by: Torin Sandall <torinsandall@gmail.com>

tsandall requested a review from ashutosh-narkar August 13, 2020 18:03

patrick-east previously approved these changes Aug 13, 2020

View reviewed changes

ashutosh-narkar previously approved these changes Aug 13, 2020

View reviewed changes

tsandall dismissed stale reviews from ashutosh-narkar and patrick-east via 1c90afa August 13, 2020 20:28

tsandall force-pushed the master branch from 0046165 to 1c90afa Compare August 13, 2020 20:28

patrick-east approved these changes Aug 13, 2020

View reviewed changes

tsandall merged commit 6507fa4 into open-policy-agent:master Aug 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

topdown: Pass metrics to built-ins and record http.send latency #2629

topdown: Pass metrics to built-ins and record http.send latency #2629

tsandall commented Aug 13, 2020

patrick-east Aug 13, 2020

tsandall Aug 13, 2020

patrick-east Aug 13, 2020

tsandall Aug 13, 2020

patrick-east Aug 13, 2020

		@@ -64,6 +67,8 @@ const httpSendBuiltinCacheKey httpSendKey = "HTTP_SEND_CACHE_KEY"

		func builtinHTTPSend(bctx BuiltinContext, args []ast.Term, iter func(ast.Term) error) error {

		bctx.Metrics.Timer(httpSendLatencyMetricKey).Start()

topdown: Pass metrics to built-ins and record http.send latency #2629

topdown: Pass metrics to built-ins and record http.send latency #2629

Conversation

tsandall commented Aug 13, 2020

patrick-east Aug 13, 2020

Choose a reason for hiding this comment

tsandall Aug 13, 2020

Choose a reason for hiding this comment

patrick-east Aug 13, 2020

Choose a reason for hiding this comment

tsandall Aug 13, 2020

Choose a reason for hiding this comment

patrick-east Aug 13, 2020

Choose a reason for hiding this comment