-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
agent should remember state of checks across restart #1001
Comments
This is especially important when a check may have a long TTL (like in the 10s of minutes). |
Hey @blalor, this sounds like it could be a regression. I'll crack into this one very soon. Thanks for reporting it! |
@blalor I remembered wrong, we were never restoring the previous status up to this point. I've started a branch for this. I think it will need to be optional and off by default so as to not disrupt clusters which depend on the current functionality. I'll update here when I'm closer to having something usable. |
That's great, thank you. I'm ok with opting-in, although I'm not sure that's the behavior I'd expect by default. :-) |
@blalor the above merged PR should fix the TTL checks and restore their status on later agent start. No config option needed for this case since it is only the TTL checks that get this new behavior. |
Excellent! Thanks! |
* Update CHANGELOG for Consul 1.10.0 Co-authored-by: Ashwin Venkatesh <ashwin@hashicorp.com>
When upgrading my consul agents from 0.5.0 to 0.5.2, I noticed that TTL checks that were previously passing are now showing as
critical
. This causes a massive disruption inthe forceour monitoring system, especially since we're exposing ~1000 services on some hosts. I understand the reasoning behind defaulting the state of a check to critical, but I think it'd be reasonable to have the agent use the last-known state for a check when restarting. For a TTL check, this may just be using the last-known state if the agent restarts before (a multiple of?) the TTL has expired.The text was updated successfully, but these errors were encountered: