-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Tracking] AWS Cloudwatch metrics reporting issue with CGroupsV2 #585
Comments
After quickly checked github.com/containerd/cgroups v0.0.0-20170627184340-c3fc2b77b568 Could you open an issue on the repository to get a confirmation from AWS folks?
Since |
We reached out to AWS support on this issue and they had confirmed that ecs-agent dont support cgroups v2 at this time and they have an open issue. We dont have a ETA around this still. |
@karthikeyanvenkatraman thanks again for your report, two documentations points have been added:
Let's keep this issue opened has a tracking one for future generations. |
I see this is resolved now with the release of ecs agent v1.61.0 this morning. |
@karthikeyanvenkatraman this is great - it's a really good news. Did you get a chance to test it ? |
@tormath1 , yes. we did roll out the changes and did not notice any issues until now. |
I'm closing this issue as resolved. Free free to reopen if you need to discuss more. |
Description
We are using faltcar AWS AMI's for ECS cluster and since the release of flatcar stable version 2983.2.0, We are no longer able to get ECS Cloudwatch Metrics. Seems like ecs-agent is not compatible with Cgroups V2
Impact
On upgrading the instances to flatcar AMI ami-08165d837cc8ef7f6, AWS ECS Cluster and Service metrics such as CPU Utilization and Memory Utilization are unavailable.
Environment and steps to reproduce
Expected behavior
The ECS instances are supposed to send metrics such as CPU Utilization and Memory Utilization to cloudwatch. However we get the below messages from the ecs-agent.
msg="cloudwatch metrics for container 7d8386b039d1a0726d863a74412824d39a97f984258ed315a60d58cb90cc5fbf not collected, reason (cpu): need at least 2 data points in queue to calculate CW stats set" module=engine.go
Additional information
We could manually force the AMI to use the legacy cgroups using the doc - https://www.flatcar-linux.org/docs/latest/container-runtimes/switching-to-unified-cgroups/#starting-new-nodes-with-legacy-cgroups, However this requires a restart and looks to be a time consuming factor. We would like to know if there is any alternate fix available without restart and still be able to use the new features that comes up as part of the stable latest releases.
The text was updated successfully, but these errors were encountered: