Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Controllers with inconsistent state can cause intermittent failures #2793

Open
stuartwdouglas opened this issue Sep 23, 2024 · 0 comments
Open
Labels
core The core of FTL

Comments

@stuartwdouglas
Copy link
Collaborator

This is a follow on issue for the PR #2790 that adds a delay before deleting runners.

Fundamentally the issue is about deciding when a deployment is 'ready', and when old deployments can be removed. There are other cases that we need to consider here as well, e.g. starting async calls before all controllers have their route tables updates can result in failure.

We need to remove the sleep based hack and come up with a proper solution around deployment readiness.

@github-actions github-actions bot added the triage Issue needs triaging label Sep 23, 2024
@ftl-robot ftl-robot mentioned this issue Sep 23, 2024
@alecthomas alecthomas added advanced Requires advanced understanding of FTL internals core The core of FTL and removed triage Issue needs triaging advanced Requires advanced understanding of FTL internals labels Sep 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core The core of FTL
Projects
None yet
Development

No branches or pull requests

2 participants