Perma-diff due to new location_policy being repeatedly unset #1478

jawnsy · 2022-11-24T00:28:37Z

TL;DR

The new autoscaling location_policy setting is set to null to avoid issues with pre-1.24 clusters, however, this results in a permadiff due to the provider repeatedly changing ANY to null

Expected behavior

I expected upgrading module versions without modifying my infrastructure not to require any changes to infrastructure, or if it required changes, then I expected it to apply the change as a one-time upgrade migration.

It would be nice if the module could supply the appropriate location policy according to the control plane version (for post-1.24 clusters, supply ANY, otherwise, supply null), but this logic may belong in the provider instead.

Observed behavior

This may be an issue with the provider, but the symptom of this is that applying changes results in the location_policy being repeatedly reset to the default, which is ANY:

  ~ resource "google_container_node_pool" "pools" {
        name                        = "pool-69f6"
        # (10 unchanged attributes hidden)

      ~ autoscaling {
          - location_policy      = "ANY" -> null
            # (4 unchanged attributes hidden)
        }

        # (5 unchanged blocks hidden)
    }

This is a perma-diff that #1452 was trying to fix.

Terraform Configuration

n/a

Terraform Version

Terraform v1.3.4
on darwin_arm64
+ provider registry.terraform.io/hashicorp/google v4.42.0
+ provider registry.terraform.io/hashicorp/google-beta v4.44.1
+ provider registry.terraform.io/hashicorp/kubernetes v2.16.0
+ provider registry.terraform.io/hashicorp/random v3.4.3

Additional information

The workaround/solution is for users to explicitly set a location_policy to ANY if they are using a post-1.24 cluster, or null otherwise.

The text was updated successfully, but these errors were encountered:

marcleibold · 2022-12-08T15:56:37Z

Hi @jawnsy ,

I have the same problem, just that mine changes away from BALANCED the whole time.

~ resource "google_container_node_pool" "pools" {
        name                        = "default-node-pool"
        # (10 unchanged attributes hidden)

      ~ autoscaling {
          - location_policy      = "BALANCED" -> null
            # (4 unchanged attributes hidden)
        }

        # (5 unchanged blocks hidden)
    }

Do you know where exactly to set this location_policy for the workaround? To resolve the permadiff locally

marcleibold · 2022-12-08T16:49:21Z

Nevermind, I just tried to set it in the module definition via:

cluster_autoscaling = {
    enabled             = false
    autoscaling_profile = null
    min_cpu_cores       = 0
    max_cpu_cores       = 0
    min_memory_gb       = 0
    max_memory_gb       = 0
    gpu_resources       = []
}

But this line prevents the module from picking up the change. That workaround doesn't work.

I also tried setting it to something like "", but then I just get the following error message:

╷
│ Error: expected cluster_autoscaling.0.autoscaling_profile to be one of [BALANCED OPTIMIZE_UTILIZATION], got 
│ 
│   with module.clickhouse.module.gke.module.gke.google_container_cluster.primary,
│   on .terraform/modules/clickhouse.gke.gke/modules/beta-private-cluster/cluster.tf line 107, in resource "google_container_cluster" "primary":
│  107:     autoscaling_profile = var.cluster_autoscaling.autoscaling_profile != null ? var.cluster_autoscaling.autoscaling_profile : "BALANCED"
│ 
╵

So this needs to be changed in the source code first if I am not completely mistaken.
If I am, feel free to correct me

jawnsy · 2022-12-08T20:27:42Z

@marcleibold Here's what I'm using for my node pool setting:

node_pools = [
  {
    name               = "pool"
    preemptible        = false
    spot               = false
    enable_secure_boot = true
    enable_gcfs        = true
    machine_type       = "t2d-standard-16"
    initial_node_count = 1
    min_count          = 0
    max_count          = 10
    max_surge          = 4
    image_type         = "COS_CONTAINERD"
    location_policy    = "ANY"
  },
]

You just have to add location_policy to your node pool config (ANY or BALANCED). By default, location_policy is null but Google Cloud will set it to something for post-1.24 clusters, which results in the perma-diff. The setting is a node pool setting, not a cluster setting: https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-autoscaler#location_policy

bharathkkb · 2022-12-09T05:38:02Z

Thanks for the report @jawnsy
This looks like a provider bug. IIUC by defaulting location_policy to null, it should behave as if location_policy was not set at all and managed by the provider.

gleichda · 2022-12-21T07:46:07Z

This was fixed in GoogleCloudPlatform/magic-modules#6982

github-actions · 2023-02-19T23:15:01Z

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days

jawnsy added the bug Something isn't working label Nov 24, 2022

github-actions bot added the Stale label Feb 19, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Feb 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perma-diff due to new location_policy being repeatedly unset #1478

Perma-diff due to new location_policy being repeatedly unset #1478

jawnsy commented Nov 24, 2022

marcleibold commented Dec 8, 2022

marcleibold commented Dec 8, 2022

jawnsy commented Dec 8, 2022

bharathkkb commented Dec 9, 2022

gleichda commented Dec 21, 2022

github-actions bot commented Feb 19, 2023

Perma-diff due to new location_policy being repeatedly unset #1478

Perma-diff due to new location_policy being repeatedly unset #1478

Comments

jawnsy commented Nov 24, 2022

TL;DR

Expected behavior

Observed behavior

Terraform Configuration

Terraform Version

Additional information

marcleibold commented Dec 8, 2022

marcleibold commented Dec 8, 2022

jawnsy commented Dec 8, 2022

bharathkkb commented Dec 9, 2022

gleichda commented Dec 21, 2022

github-actions bot commented Feb 19, 2023