eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) #6320

xmudrii · 2024-01-24T15:27:17Z

xref #6303 (see for more details and context)

As per recommendations received from the AWS support and @tzneal, we're replacing blue and green node groups with node groups per AZ. In other words, old node groups had instances in all three AZs. Now, we have a dedicated node group for each AZ. This is a short-term solution to fix the stability issues that we're facing.

This has been successfully rolled out to canary, I'll do prod rollout once this PR is reviewed and merged.

Notes:

We plan to switch to Karpenter long-term (eks-prow-build-cluster: Use Karpenter instead of cluster-autoscaler #5168)
We didn't want to suspend AZRebalacing process because it's not natively supported by EKS, so we opted in for dedicated node groups as that solution can be fully-automated using Terraform
We have dedicated Terraform objects and variables for each Node Group. I explicitly didn't want to use count because that would make rolling out upgrades too complicated

Follow-ups:

Remove blue/green Terraform variables and objects
Update docs to reflect changes in the upgrade procedure

/assign @upodroid @ameukam @dims

Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>

upodroid

LGTM

k8s-ci-robot · 2024-01-24T15:33:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: upodroid, xmudrii

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~infra/aws/terraform/prow-build-cluster/OWNERS~~ [upodroid,xmudrii]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

upodroid · 2024-01-24T15:34:01Z

cancel the hold when you are ready to merge
/hold

xmudrii · 2024-01-24T15:38:10Z

Let's get this merged and I'll create a new PR for follow-ups
/hold cancel

bryantbiggs · 2024-01-24T15:39:24Z

infra/aws/terraform/prow-build-cluster/node_group_us-east-2a.tf

+
+    cluster_version = var.node_group_version_us_east_2a
+
+    taints = var.node_taints_build


FYI - you can use eks_managed_node_group_defaults to set the default values you wish to use across all of the managed nodegroups, and then any specific settings unique to that nodegroup can be set in the nodegroup definition. You can also set the defaults and still override the value in the nodegroup definition as well

So roughly speaking, something like the following might help cut down on the copy+pasta across nodegroup definitions:

eks_managed_node_group_defaults = { use_name_prefix = true taints = var.node_taints_build labels = var.node_labels_build ... anything else that you want common across the nodegroups }

That's neat, thanks for pointing it out! I'll keep it in mind and see if we can make use of it the next time we do some rollout

xmudrii added 4 commits January 24, 2024 16:17

Step 0: Prepare new Managed Node Groups per AZ

9196ba1

Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>

Step 1: Deploy new Managed Node Groups per AZ

86c05c1

Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>

Step 2: Remove legacy managed node groups

eca8156

Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>

Step 3: Cleanup bootstrap scripts

bc1f27f

Signed-off-by: Marko Mudrinić <mudrinic.mare@gmail.com>

k8s-ci-robot assigned ameukam, dims and upodroid Jan 24, 2024

k8s-ci-robot requested review from pkprzekwas and thockin January 24, 2024 15:27

upodroid approved these changes Jan 24, 2024

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 24, 2024

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 24, 2024

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 24, 2024

bryantbiggs reviewed Jan 24, 2024

View reviewed changes

k8s-ci-robot merged commit 33a5d9d into kubernetes:main Jan 24, 2024
3 checks passed

k8s-ci-robot added this to the v1.30 milestone Jan 24, 2024

xmudrii deleted the eks-mng-per-az branch January 24, 2024 15:49

xmudrii mentioned this pull request Jan 24, 2024

eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) (prod rollout) #6325

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) #6320

eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) #6320

xmudrii commented Jan 24, 2024

upodroid left a comment

k8s-ci-robot commented Jan 24, 2024

upodroid commented Jan 24, 2024

xmudrii commented Jan 24, 2024

bryantbiggs Jan 24, 2024

xmudrii Jan 24, 2024


		cluster_version = var.node_group_version_us_east_2a

		taints = var.node_taints_build

eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) #6320

eks-prow-build-cluster: Use dedicated Managed Node Groups (MNGs) per Availability Zone (AZ) #6320

Conversation

xmudrii commented Jan 24, 2024

upodroid left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Jan 24, 2024

upodroid commented Jan 24, 2024

xmudrii commented Jan 24, 2024

bryantbiggs Jan 24, 2024

Choose a reason for hiding this comment

xmudrii Jan 24, 2024

Choose a reason for hiding this comment