-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Monitor runners in case of insufficient resources #251
Comments
Some of the same considerations I mentioned about |
I believe we're encountering We have hard limits set for K8S namespaces (CPU/Memory/Max number of pods). If a test setup violates the aforementioned limits, it results in an infinite loop. For instance, if someone sets |
@freevatar thanks! Your case is a "perfect" example of this problem. One thing I'd like to clarify: since you pointed out the version, did you not encounter this problem in previous versions, like v0.0.10rc2, etc.? |
@na-- I've missed your comments 🤦 thank you! But yes, this particular case is more about "Kubernetes level" UX rather than k6. Either way, it is in my TODO plans to go through your distributed updates in k6 repo - I'll comment then 👍 |
Sorry for confusion, what I meant is that we tested the latest available version as well. |
Feature Description
When one of the runners does not have sufficient resources allocated for the test, it goes into OOM state (insufficient memory for VUs. There can be other types of error for the same case as well). This condition is not monitored by the operator in any way, resulting in infinite wait loop for the pods to bootstrap.
This case should be monitored by the operator, followed by abortion of the test.
Suggested Solution (optional)
By initial experiments, there are two loops that can become infinite in such cases, at
stage = "created"
andstage = "started"
.Note that test runs in different modes need to handled this case differently.
Already existing or connected issues / PRs (optional)
Potentially connected issue: #222
The text was updated successfully, but these errors were encountered: