Skip to content

Commit

Permalink
Merge pull request 'fixed the alert expression for node which are not…
Browse files Browse the repository at this point in the history
… sending metrics within the timefram' (#370) from fix_opsmondo_alert into master

Reviewed-on: https://gitea.obmondo.com/EnableIT/KubeAid/pulls/370
  • Loading branch information
Klavs Klavsen committed Aug 23, 2024
2 parents f8f5399 + e40cfb9 commit 0cff28f
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions argocd-helm-charts/prometheus-linuxaid/rules/prometheus.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,9 @@ groups:
rules:
- alert: monitor::metrics::missing::pushprox
expr: |
(up{job="pushprox"} == 0) * on (certname) group_left(job) (node_time_seconds - node_boot_time_seconds > 600)
and on(certname) obmondo_monitoring{alert_id="monitor::metrics::missing::pushprox"} > 0
(100 * (count(up == 0) BY (certname, job) / count(up) BY (certname, job)) > 10)
* on (certname) group_right(job) (node_time_seconds - node_boot_time_seconds > 600)
and on(certname) obmondo_monitoring{alert_id="monitor::prometheus::metrics"} > 0
for: 30m
labels:
severity: critical
Expand Down

0 comments on commit 0cff28f

Please sign in to comment.