Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Monitoring] Re-evaluate how alertInstanceId is used in rules #109151

Open
chrisronline opened this issue Aug 18, 2021 · 0 comments
Open

[Monitoring] Re-evaluate how alertInstanceId is used in rules #109151

chrisronline opened this issue Aug 18, 2021 · 0 comments
Labels
bug Fixes for quality problems that affect the customer experience Team:Monitoring Stack Monitoring team

Comments

@chrisronline
Copy link
Contributor

When we first created Kibana rules in stack monitoring, we were under the assumption that we could create unique alertInstanceIds to maintain separate throttle periods for a single alert firing under separate circumstances (such as a disk usage alert firing on unique throttle periods based on the list of nodes affected). This does not work well with the fact that alertInstanceIds that do not schedule actions are forgotten.

Recent work changed how these work and actually helped address this by ensuring we always create the same alertInstanceIds each time the rule runes, but it looks like we might have issues if we aren't always scheduling actions for these (which looks to be the case).

I'm opening this issue because this wasn't understood when these rules were first created and folks on the @elastic/stack-monitoring team might want to reconsider how these rules are designed as a result. Feel free to close if this is now understood and handled appropriately.

@chrisronline chrisronline added bug Fixes for quality problems that affect the customer experience Team:Monitoring Stack Monitoring team labels Aug 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Fixes for quality problems that affect the customer experience Team:Monitoring Stack Monitoring team
Projects
None yet
Development

No branches or pull requests

1 participant