Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

agent should remember state of checks across restart #1001

Closed
blalor opened this issue Jun 4, 2015 · 6 comments
Closed

agent should remember state of checks across restart #1001

blalor opened this issue Jun 4, 2015 · 6 comments
Labels
type/bug Feature does not function as expected

Comments

@blalor
Copy link
Contributor

blalor commented Jun 4, 2015

When upgrading my consul agents from 0.5.0 to 0.5.2, I noticed that TTL checks that were previously passing are now showing as critical. This causes a massive disruption in the force our monitoring system, especially since we're exposing ~1000 services on some hosts. I understand the reasoning behind defaulting the state of a check to critical, but I think it'd be reasonable to have the agent use the last-known state for a check when restarting. For a TTL check, this may just be using the last-known state if the agent restarts before (a multiple of?) the TTL has expired.

@blalor
Copy link
Contributor Author

blalor commented Jun 4, 2015

This is especially important when a check may have a long TTL (like in the 10s of minutes).

@ryanuber ryanuber added the type/bug Feature does not function as expected label Jun 4, 2015
@ryanuber
Copy link
Member

ryanuber commented Jun 4, 2015

Hey @blalor, this sounds like it could be a regression. I'll crack into this one very soon. Thanks for reporting it!

@ryanuber
Copy link
Member

ryanuber commented Jun 5, 2015

@blalor I remembered wrong, we were never restoring the previous status up to this point. I've started a branch for this. I think it will need to be optional and off by default so as to not disrupt clusters which depend on the current functionality. I'll update here when I'm closer to having something usable.

@blalor
Copy link
Contributor Author

blalor commented Jun 5, 2015

That's great, thank you. I'm ok with opting-in, although I'm not sure that's the behavior I'd expect by default. :-)

@ryanuber
Copy link
Member

ryanuber commented Jun 8, 2015

@blalor the above merged PR should fix the TTL checks and restore their status on later agent start. No config option needed for this case since it is only the TTL checks that get this new behavior.

@blalor
Copy link
Contributor Author

blalor commented Jun 8, 2015

Excellent! Thanks!

duckhan pushed a commit to duckhan/consul that referenced this issue Oct 24, 2021
* Update CHANGELOG for Consul 1.10.0

Co-authored-by: Ashwin Venkatesh <ashwin@hashicorp.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/bug Feature does not function as expected
Projects
None yet
Development

No branches or pull requests

2 participants