Karpenter constantly scales-up for a single pod #2821

jonathan-innis · 2022-11-08T18:45:25Z

Version

Karpenter Version: v0.16.3

Expected Behavior

Karpenter should only provision a single node for the pod that needs to be scheduled on the cluster.

Actual Behavior

Karpenter schedules a node, the pod does not schedule to that node so Karpenter continually provisions nodes every 20s.

Steps to Reproduce the Problem

Unclear what the repro steps are at this time.

Resource Specs and Logs

Deployment Spec

apiVersion: apps/v1
kind: Deployment
metadata:
  name: app
  labels:
    imageTag: "7a3e144"
    app: app
    chart: apiHelm-0.1.0
    heritage: "Helm"
    release: "app"
spec:
  replicas: 1
  selector:
    matchLabels:
      app: app
  template:
    metadata:
      annotations:
        checksum/config: 95ad22c90ba988c45e8501e4be7583b090ba71aaa3dc7ff1c6517b472dd1f29f
        checksum/external-secrets: 5bf89530b7f9a34d97c64e8149f5ed441fe1a01ae708582c06a1a09c55ef81a8
      labels:
        app: app
        release: "app"
    spec:
      serviceAccountName: eng-18041-app
      containers:
      - name: app
        image: "*********.dkr.ecr.ap-southeast-1.amazonaws.com/app:7a3e144"
        imagePullPolicy: Always
        command:
        - "dumb-init"
        - "bundle"
        - "exec"
        - "rails"
        - "s"
        - "-b"
        - "[::]"
        envFrom:
        - configMapRef:
            name: "app-config"
        - secretRef:
            name: app-parameter
        ports:
        - name: http
          containerPort: 3000
          protocol: TCP
        resources:
          limits:
            memory: 512Mi
          requests:
            cpu: 500m
            memory: 512Mi
        readinessProbe:
          httpGet:
            path: /health_check
            port: http
          initialDelaySeconds: 0
          periodSeconds: 1
          timeoutSeconds: 1
          successThreshold: 1
          failureThreshold: 3
      nodeSelector:
        kubernetes.io/arch: "amd64"
      topologySpreadConstraints:
      - maxSkew: 1
        topologyKey: "kubernetes.io/hostname"
        whenUnsatisfiable: ScheduleAnyway
        labelSelector:
          matchLabels:
            app: app

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

ellistarn · 2022-11-08T22:47:23Z

Can we add a integ test that enforces this? Runaway scaling test? Maybe a separate suite?

jonathan-innis · 2022-11-09T01:08:41Z

Can we add a integ test that enforces this? Runaway scaling test? Maybe a separate suite?

This fits into the category of chaos/failure testing to me. Was planning to create a chaos testing Describe block in E2ETesting

jonathan-innis · 2022-11-09T01:16:09Z

Able to repro a hypothesis for the runaway scaling issue by continually tainting the node after launch using Karpenter v0.16.3 with the script

while true
do
    kubectl get nodes -A --selector karpenter.sh/provisioner-name | cut -d " " -f 1 | xargs -I "{}" kubectl taint node {} special=true:NoExecute
    sleep 1
done

This issue was resolved with #2614 by removing the stabalization window, which would have prevented empty node removal during the infinite scale-up. This change will be released as part of the v0.19.0 and should mitigate the issue.

Provisioners that were using ttlSecondsAfterEmpty would not have been impacted by this runaway scale-up issue since ttlSecondsAfterEmpty has no stabilization window involved during node deletion consideration.

It's worth noting that without enabling either ttlSecondsAfterEmpty or consolidation.enabled, this issue persists for nodes that would receive taints that Karpenter is unaware of after node launch

jonathan-innis · 2022-11-10T00:25:50Z

Tracked down the issue: The problem is that max-pods == 110 in the userData but Karpenter is unaware of the value. It does scheduling calculations based on ENI_LIMITED_POD_DENSITY, which for the instance type it launches (t4g.small) is 11 (see https://karpenter.sh/v0.18.1/aws/instance-types/#t4gsmall). This number of pods is used to calculate kube-reserved for the node that is being launched. Karpenter does the calculation and thinks that the node will have a larger allocatable than it actually does (which Bottlerocket calculates based on the 110 metric). This means that the node is launched with a smaller allocatable capacity than Karpenter thinks it will have, leading to Karpenter realizing this and then continually launching a new node.

The solution here is to migrate the userData configuration with max-pods and cluster-dns-ip to the spec.kubeletConfiguration in the Provisioner so that Karpenter is aware of it.

jonathan-innis added the bug Something isn't working label Nov 8, 2022

jonathan-innis self-assigned this Nov 8, 2022

jonathan-innis mentioned this issue Nov 9, 2022

test: Chaos testing for Runaway Scale-up Issue #2823

Merged

3 tasks

jonathan-innis closed this as completed Nov 10, 2022

rakechill mentioned this issue Mar 22, 2024

test(e2e): add in chaos E2E test suite Azure/karpenter-provider-azure#61

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Karpenter constantly scales-up for a single pod #2821

Karpenter constantly scales-up for a single pod #2821

jonathan-innis commented Nov 8, 2022 •

edited

Loading

ellistarn commented Nov 8, 2022 •

edited

Loading

jonathan-innis commented Nov 9, 2022

jonathan-innis commented Nov 9, 2022

jonathan-innis commented Nov 10, 2022

Karpenter constantly scales-up for a single pod #2821

Karpenter constantly scales-up for a single pod #2821

Comments

jonathan-innis commented Nov 8, 2022 • edited Loading

Version

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Resource Specs and Logs

Community Note

ellistarn commented Nov 8, 2022 • edited Loading

jonathan-innis commented Nov 9, 2022

jonathan-innis commented Nov 9, 2022

jonathan-innis commented Nov 10, 2022

jonathan-innis commented Nov 8, 2022 •

edited

Loading

ellistarn commented Nov 8, 2022 •

edited

Loading