Add metrics to the catalog server #156

everettraven · 2023-08-31T14:40:32Z

Description

Adds metrics to the catalog server for calculating the apdex score

Motivation

resolves Add metrics to the Storage implementation #127

Note
This PR is based on #148 and should only be merged after. Due to this, this PR will remain as a draft until that #148 has merged

codecov · 2023-08-31T14:43:21Z

Codecov Report

Patch and project coverage have no change.

Comparison is base (966a4d6) 79.06% compared to head (484acfd) 79.06%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #156   +/-   ##
=======================================
  Coverage   79.06%   79.06%           
=======================================
  Files           3        3           
  Lines         215      215           
=======================================
  Hits          170      170           
  Misses         28       28           
  Partials       17       17

☔ View full report in Codecov by Sentry.

📢 Have feedback on the report? Share it here.

pkg/server/metrics.go

that can be used for calculating the Apdex Score and assess the health of the http server that is serving catalog contents to clients Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

cmd/manager/main.go

Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

pkg/server/metrics.go

Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

joelanford · 2023-09-08T17:03:56Z

pkg/metrics/metrics.go

+			// calculate Apdex Scores up to a T of 1 second, but using various mathmatical formulas we
+			// should be able to estimate Apdex Scores up to a T of 2.5. Having a larger range of buckets
+			// will allow us to more easily calculate health indicators other than the Apdex Score.
+			Buckets: []float64{0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 1.2, 1.6, 2, 2.4, 2.8, 3.2, 3.6, 4, 10},


Does it matter that our write timeout is 10s and the max bucket duration is 10s?

Seems like we'll only ever get whatever error code maps to that timeout in the 10s bucket.

I don't think so. If anything I think it means that we now have buckets that capture all possible response times and that allows us to calculate more metrics on the fly. This is all going based on #156 (comment) . Since if no requests take more than 10s we will never have anything in the "Inf" bucket.

That being said, I could be wrong - I don't have enough experience in this area to truly know and am making an assumption with what I currently know

Seems like we'll only ever get whatever error code maps to that timeout in the 10s bucket.

Any response time > 4s and <= 10s will fall in that 10s bucket

Ah, got it. sgtm.

The suggested change was accepted. Huzzah!

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Aug 31, 2023

stevekuznetsov reviewed Sep 1, 2023

View reviewed changes

pkg/server/metrics.go Outdated Show resolved Hide resolved

stevekuznetsov approved these changes Sep 5, 2023

View reviewed changes

everettraven force-pushed the feature/server-metrics branch from dbb9540 to efa5262 Compare September 8, 2023 16:45

add metrics to catalogd http server

1c56928

that can be used for calculating the Apdex Score and assess the health of the http server that is serving catalog contents to clients Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

everettraven force-pushed the feature/server-metrics branch from 7fcb6ea to 1c56928 Compare September 8, 2023 16:48

everettraven changed the title ~~WIP: Add metrics to the catalog server~~ Add metrics to the catalog server Sep 8, 2023

everettraven marked this pull request as ready for review September 8, 2023 16:49

everettraven requested a review from a team as a code owner September 8, 2023 16:49

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 8, 2023

ncdc reviewed Sep 8, 2023

View reviewed changes

cmd/manager/main.go Show resolved Hide resolved

ncdc reviewed Sep 8, 2023

View reviewed changes

cmd/manager/main.go Outdated Show resolved Hide resolved

quick fixes from review comments

87b10f2

Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

anik120 previously requested changes Sep 8, 2023

View reviewed changes

pkg/server/metrics.go Outdated Show resolved Hide resolved

everettraven added 2 commits September 8, 2023 12:59

rename package from server --> metrics

eb9e187

Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

rename package from server --> metrics

484acfd

Signed-off-by: Bryce Palmer <bpalmer@redhat.com>

joelanford reviewed Sep 8, 2023

View reviewed changes

joelanford approved these changes Sep 8, 2023

View reviewed changes

everettraven added this pull request to the merge queue Sep 8, 2023

Merged via the queue into operator-framework:main with commit a1663ec Sep 8, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add metrics to the catalog server #156

Add metrics to the catalog server #156

everettraven commented Aug 31, 2023 •

edited

Loading

codecov bot commented Aug 31, 2023 •

edited

Loading

joelanford Sep 8, 2023

everettraven Sep 8, 2023

everettraven Sep 8, 2023 •

edited

Loading

joelanford Sep 8, 2023

Add metrics to the catalog server #156

Add metrics to the catalog server #156

Conversation

everettraven commented Aug 31, 2023 • edited Loading

codecov bot commented Aug 31, 2023 • edited Loading

Codecov Report

joelanford Sep 8, 2023

Choose a reason for hiding this comment

everettraven Sep 8, 2023

Choose a reason for hiding this comment

everettraven Sep 8, 2023 • edited Loading

Choose a reason for hiding this comment

joelanford Sep 8, 2023

Choose a reason for hiding this comment

everettraven commented Aug 31, 2023 •

edited

Loading

codecov bot commented Aug 31, 2023 •

edited

Loading

everettraven Sep 8, 2023 •

edited

Loading