KUBE-5984 - ensure that scale-ups always occur when there are starved pods #225

triorph · 2023-06-23T01:44:49Z

This fixes #224

The main change here is to add a ScaleOnStarve option to the node group configuration. This true/false value configures an additional check on the nodeDelta calculated during the scaling step.

When we gather the RequestedPod, we also gather the largest pending pods (by both CPU and Memory). When we gather the node capacity, we also gather the largest node (node with allocatable CPU/Memory minus used pod CPU/memory) that is the highest. If either of the requested pods have larger requirements than what's available on the largest capacity, then that indicates we have a "starved pod". In the case that a pod exists with no nodes available (and we have ScaleOnStarve enabled), then we make sure that the scaling algorithm has at least 1 scale up as the final result.

… pods

awprice · 2023-06-23T01:51:55Z

This solves #224

If you want this to close the issue, you'll need to use the keywords from here - https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword, e.g. closes/fixes/resolves etc.

awprice

Overall looks really good, just some small nits and clarifications

pkg/k8s/util.go

pkg/controller/controller.go

pkg/k8s/util.go

awprice · 2023-06-23T07:52:21Z

Would be good to see in the description of this PR what the change does to fix the issue - I had to look through the code to see how it addresses the original issue

pkg/controller/controller.go

awprice · 2023-06-27T09:59:26Z

LGTM - thanks for the contribution!

Jacobious52

Looks good. Only that one nit from me, happy to leave decision to you.

pkg/controller/controller.go

KUBE-5984 - ensure that scale-ups always occur when there are starved…

348fb26

… pods

Jacobious52 requested review from awprice and Jacobious52 June 23, 2023 01:48

Bug fixes and golangci-lint

628f205

awprice requested changes Jun 23, 2023

View reviewed changes

pkg/k8s/util.go Outdated Show resolved Hide resolved

pkg/k8s/util.go Outdated Show resolved Hide resolved

pkg/controller/controller.go Show resolved Hide resolved

pkg/k8s/util.go Outdated Show resolved Hide resolved

mwalsh2atlassian added 3 commits June 26, 2023 10:14

Address PR comments (except metrics, to do next)

18064cc

Also add a test for scaling up greater than the scaleOnStarve limit

f60382c

Add in metrics

83786cc

hittingray reviewed Jun 25, 2023

View reviewed changes

pkg/controller/controller.go Outdated Show resolved Hide resolved

Add docs for Scale On Starve

32b76ac

awprice previously approved these changes Jun 27, 2023

View reviewed changes

Jacobious52 previously approved these changes Jun 30, 2023

View reviewed changes

pkg/controller/controller.go Outdated Show resolved Hide resolved

refactor scaleOnStarve check

9ae4707

triorph dismissed stale reviews from Jacobious52 and awprice via 9ae4707 July 2, 2023 21:49

Jacobious52 approved these changes Jul 2, 2023

View reviewed changes

awprice approved these changes Jul 2, 2023

View reviewed changes

awprice merged commit b82fb50 into atlassian:master Jul 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KUBE-5984 - ensure that scale-ups always occur when there are starved pods #225

KUBE-5984 - ensure that scale-ups always occur when there are starved pods #225

triorph commented Jun 23, 2023 •

edited

Loading

awprice commented Jun 23, 2023

awprice left a comment

awprice commented Jun 23, 2023

awprice commented Jun 27, 2023

Jacobious52 left a comment

KUBE-5984 - ensure that scale-ups always occur when there are starved pods #225

KUBE-5984 - ensure that scale-ups always occur when there are starved pods #225

Conversation

triorph commented Jun 23, 2023 • edited Loading

awprice commented Jun 23, 2023

awprice left a comment

Choose a reason for hiding this comment

awprice commented Jun 23, 2023

awprice commented Jun 27, 2023

Jacobious52 left a comment

Choose a reason for hiding this comment

triorph commented Jun 23, 2023 •

edited

Loading