-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mixin: Add Thanos Ruler alerts #1963
Conversation
Signed-off-by: Kemal Akkoyun <kakkoyun@gmail.com>
Signed-off-by: Kemal Akkoyun <kakkoyun@gmail.com>
Signed-off-by: Kemal Akkoyun <kakkoyun@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work!
When creating/changing dashboards it would be nice to post some screenshots of that. 😊
sum by (job, pod, rule_group) (prometheus_rule_group_last_duration_seconds{%(selector)s}) | ||
> | ||
sum by (job, pod, rule_group) (prometheus_rule_group_interval_seconds{%(selector)s}) | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not too sure about this rule, as it's always evaluation for us, even though everything seems fine.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This still indicates a problem though. Its severity set to warning
. We might also give a coefficient for the interval but I think we should have this.
With this alert, you can detect you have too many rules for a group for example. Or your queries take too much time.
It's not very critical but I still see value in this.
Signed-off-by: Kemal Akkoyun <kakkoyun@gmail.com>
@metalmatze ping |
@metalmatze Could you merge? Or should we wait for another opinion? |
This PR adds missing Ruler alerts to thanos-mixin.
Alerts are inspired by https://thanos.io/components/rule.md/#must-have-essential-ruler-alerts
Signed-off-by: Kemal Akkoyun kakkoyun@gmail.com
Changes
Verification
For rules:
make example-rules-lint
For dashboards: