-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unclean exit of bootstrap.sh on Neuron instances #1826
Comments
The bootstrap script shouldn't exit with a non-zero code in this way -- we need to move this |
Hey, thanks for the prompt response! |
Hi, when can I expect this fix to be reflected in the actual systems? I was actually kinda blocked on it as the workaround is not applicable for my project. |
This will land in an AMI build next week. 👍 |
Thanks! |
This has been resolved 👍 |
Hi,
So I was trying to create an instance group with
inf2.xlarge
instance type in an eks cluster. According to the AWS docs: here & AWS Neuron Docs: here, the EKS optimized accelarated AMIs should support it. I tried creating this using/aws/service/eks/optimized-ami/1.28/amazon-linux-2-gpu/recommended/image_id
as the ssm parameter for the AMI.But the creation of the instance group failed with the following error message in cloudformation stacks:
Received 1 FAILURE signal(s) out of 1. Unable to satisfy 100% MinSuccessfulInstancesPercent requirement
.To troubleshoot it a bit further, I SSHed into the ec2 instance and found the following errors in the
cloud-init.log
and thecloud-init-output.log
:cloud-init.log
:cloud-init-output.log
:And the line 649 where it fails in the
/etc/eks/bootstrap.sh
is:amazon-eks-ami/templates/al2/runtime/bootstrap.sh
Line 649 in 0fdc793
Please let me know how I can resolve this.
The text was updated successfully, but these errors were encountered: