Make Alert Manager API Concurrency configurable #5412

emanlodovice · 2023-06-18T23:18:08Z

What this PR does:
This PR makes the API Concurrency config value of alertmanager configurable via cortex alertmanager config.

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

alvinlin123 · 2023-06-19T17:15:17Z

docs/configuration/config-file-reference.md

@@ -416,6 +416,10 @@ cluster:
 # CLI flag: -experimental.alertmanager.enable-api
 [enable_api: <boolean> | default = false]

+# Maximum number of concurrent GET API requests before returning 503.


Should it be a limit that we can override in tenant limit?

Yes I think it makes sense that we can override this limit per tenant so we can adjust per tenant use case.

I moved the configuration to tenant limit in the latest commit 🙏

Moved back to static config because we don't create a new am instance when loading runtime config so we are not able to update this value at runtime.

alvinlin123 · 2023-06-20T17:03:51Z

docs/configuration/config-file-reference.md

@@ -3074,6 +3074,11 @@ The `limits_config` configures default and per-tenant limits imposed by Cortex s
 # alerts will fail with a log message and metric increment. 0 = no limit.
 # CLI flag: -alertmanager.max-alerts-size-bytes
 [alertmanager_max_alerts_size_bytes: <int> | default = 0]
+
+# Maximum number of concurrent GET API requests before returning 503. If 0,


Ideally if we are specific about the 503, we should probably have a test around it in case prometheus/alertmanager changes the HTTP status code. This way we can catch the change and update this doc.

I see, do you think changing the wording from "returning 503" to something like "returning an error" would be enough here?

Yea, I think "returning an error" is better because I don't think prometheus/alertmanager specifies the behaviour when concurrency is reached (503 is not defined in the API model at least :-) )

alvinlin123

LGTM, thanks!

qinxx108 · 2023-06-20T17:52:36Z

pkg/alertmanager/alertmanager.go

@@ -266,6 +268,7 @@ func New(cfg *Config, reg *prometheus.Registry) (*Alertmanager, error) {
 		GroupFunc: func(f1 func(*dispatch.Route) bool, f2 func(*types.Alert, time.Time) bool) (dispatch.AlertGroups, map[model.Fingerprint][]string) {
 			return am.dispatcher.Groups(f1, f2)
 		},
+		Concurrency: apiConcurrency,


this won't respect the run-time configuration unless tenant re-created the am config

Thank you for this insight. We are moving this back as a static config.

Signed-off-by: Emmanuel Lodovice <lodovice@amazon.com>

docs/configuration/config-file-reference.md

pull-request-size bot added the size/S label Jun 18, 2023

emanlodovice force-pushed the expose-am-api-concurrency-config branch 2 times, most recently from a442430 to 35cbe38 Compare June 19, 2023 17:05

alvinlin123 reviewed Jun 19, 2023

View reviewed changes

emanlodovice force-pushed the expose-am-api-concurrency-config branch 2 times, most recently from 0e5d3ef to a19569b Compare June 20, 2023 01:22

emanlodovice requested a review from alvinlin123 June 20, 2023 01:39

emanlodovice force-pushed the expose-am-api-concurrency-config branch 2 times, most recently from 7f8ed47 to 394e166 Compare June 20, 2023 05:12

alvinlin123 reviewed Jun 20, 2023

View reviewed changes

alvinlin123 approved these changes Jun 20, 2023

View reviewed changes

alvinlin123 requested review from alvinlin123, friedrichg, alanprot and yeya24 June 20, 2023 17:06

qinxx108 reviewed Jun 20, 2023

View reviewed changes

alanprot approved these changes Jun 20, 2023

View reviewed changes

emanlodovice force-pushed the expose-am-api-concurrency-config branch 2 times, most recently from 76613d8 to 70c4b62 Compare June 20, 2023 19:04

Make Alert Manager API Concurrency configurable

13c81e4

Signed-off-by: Emmanuel Lodovice <lodovice@amazon.com>

emanlodovice force-pushed the expose-am-api-concurrency-config branch from 70c4b62 to 13c81e4 Compare June 20, 2023 20:33

emanlodovice requested a review from qinxx108 June 20, 2023 21:25

friedrichg approved these changes Jun 21, 2023

View reviewed changes

docs/configuration/config-file-reference.md Outdated Show resolved Hide resolved

alvinlin123 merged commit 2b551b6 into cortexproject:master Jun 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Alert Manager API Concurrency configurable #5412

Make Alert Manager API Concurrency configurable #5412

emanlodovice commented Jun 18, 2023 •

edited

Loading

alvinlin123 Jun 19, 2023

emanlodovice Jun 19, 2023

emanlodovice Jun 20, 2023

emanlodovice Jun 20, 2023

alvinlin123 Jun 20, 2023

emanlodovice Jun 20, 2023

alvinlin123 Jun 20, 2023

alvinlin123 left a comment

qinxx108 Jun 20, 2023

emanlodovice Jun 20, 2023

Make Alert Manager API Concurrency configurable #5412

Make Alert Manager API Concurrency configurable #5412

Conversation

emanlodovice commented Jun 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alvinlin123 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

emanlodovice commented Jun 18, 2023 •

edited

Loading