Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Managed Node Group Update Behaviour is missing a possible cause for NodeCreationFailure #842

Open
leefinfor opened this issue Jan 20, 2025 · 1 comment

Comments

@leefinfor
Copy link

leefinfor commented Jan 20, 2025

The Scale Up phase can fail with 'NodeCreationFailure' and mentions 4 possible causes, however there is a 5th which is worthy of a mention.

Each node must bootstrap within 15 minutes

If any node takes more than 15-minutes to bootstrap and join the cluster it will cause the upgrade to time out. This is the total runtime for bootstrapping a new node measured from when a new node is required to when it joins the cluster.

It does currently mention UserData as a possible cause but in the case we were investigating UserData was not broken, The node did join the cluster and was healthy, it is just that a UserData script nudged node creation beyond the 15-minute window.

https://github.com/awsdocs/amazon-eks-user-guide/blob/mainline/latest/ug/nodes/managed-node-update-behavior.adoc

For reference: [Case 173581560000418] Windows EKS Nodegroup update is not working

@leefinfor
Copy link
Author

An alternative would be to extend the UserData root cause to include runtime but the 15-minute windows is important and as such should be a separate item.

FastLaunch has been proposed as a possible solution, other options included moving items around within the userdata. Specifically anything which do not require a reboot and will not interfere with the kubelet could be moved after the script which connects the mode to the cluster.

However improving UserData performance a bigger discussion and could bloat the document. A suitable link to an existing document could work though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant