-
Notifications
You must be signed in to change notification settings - Fork 618
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WARN messages when no Tasks are scheduled #506
Comments
@miketheman We just released 1.12.1 which should have addressed a number of problems related to this. Were you seeing this with 1.12.1 or with a previous version? |
Hi @samuelkarp ! Indeed, this was observed during bringing up new instances with the latest Agent version. |
@miketheman Thanks for confirming. Can you share the logs you're seeing? I tried (trivially) to reproduce this with a new instance running our 2016.03.h AMI and I'm not seeing any of those WARNs. If you're not comfortable sharing publicly, can you send them to me at skarp (at) amazon.com? |
@samuelkarp Here's an example:
Instance details:
|
@miketheman This looks like maybe we didn't fully fix #478. Can you provide the following information? Some of this might be more sensitive, so you can either send it to me by email at skarp (at) amazon.com or open a case with AWS Support. If there is anything like credentials or auth tokens in the logs (from environment variables or command/entrypoint), please redact them. All of this should come from an instance that is currently affected:
I'd like to correlate the container that shows up in the logs ( |
@samuelkarp I have sent the logs to your amazon email address. |
@miketheman Thank you for sending all that information! I think I've narrowed this down to occurring when the agent is disconnected from and reconnects to the websocket it uses for reporting metrics. On a reconnect, it appears that the very first time it attempts to send metric data it emits this warning. I've now been able to reproduce this behavior myself, so we should be able to take it from here. Thank you for reporting this issue! |
Still reproduces on 1.13.1 |
@EugeneAbramchuk Thanks. We haven't fixed this yet since the only problem here is just a spurious WARN message. We'll keep this issue updated as it gets fixed, or if you're looking for something to contribute this would be something we'd accept. To restate what we think is going on a bit more clearly:
There is no impact other than it being an annoying WARN message. However, there is a case where |
@samuelkarp What's the ETA on this fix? |
During routine cluster management, we tend to bring up extra capacity in our cluster to be ready to accept new scheduled tasks on these instances.
We routinely see a behavior where the instance isn't running any scheduled tasks, yet emits WARN logs that look like this:
Now, I realize this may have something to do with the detection of other containers running on the instance. We run a per-container-instance Agent for Task containers to communicate with via host networking, similar to the approach described in the AWS Blog post.
Is the ECS Agent detecting the other running container, making the instance not
idle
and then failing to collect task related metrics, since there are no ECS-managed tasks?ref links:
amazon-ecs-agent/agent/stats/engine.go
Lines 229 to 231 in d5e8c51
amazon-ecs-agent/agent/stats/engine.go
Lines 183 to 188 in d5e8c51
The text was updated successfully, but these errors were encountered: