Upgrading cluster master after cluster creation completes #3385

yellowmegaman · 2019-04-05T10:45:30Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment. If the issue is assigned to the "modular-magician" user, it is either in the process of being autogenerated, or is planned to be autogenerated soon. If the issue is assigned to a user, that user is claiming responsibility for the issue. If the issue is assigned to "hashibot", a community member has claimed the issue already.

Description

Currently when creating google_container_cluster the recommended way (separate node_pool), terraform signals that everything is OK, but after only a few moments, cluster is doing upgrade that is not configurable by gke maint window.

Upgrading cluster master
The values shown below are going to change soon.

So if I'm using terraform to create GKE cluster I can't be sure that the moment terraform is done, everything is OK and proceed with some automation.

Idea: add additional (configurable) timeout, after which terraform checks if cluster is available.
Currently even endpoint is unavail when performing upgrades.

New or Affected Resource(s)

google_container_cluster

Potential Terraform Configuration

Any recommended configuration from https://www.terraform.io/docs/providers/google/r/container_cluster.html

References

google_container_cluster fails while master is upgrading on regional cluster #3249

The text was updated successfully, but these errors were encountered:

rileykarson · 2019-04-05T18:27:01Z

Are you able to consistently reproduce this with a config you can share? We run a pretty extensive # of clusters in our CI environment an this has never come up.

yellowmegaman · 2019-04-07T20:10:35Z

@rileykarson
Here you go, almost default config, but without certificates/legacy endpoints, embracing RBAC.

resource "google_container_cluster" "dev" {
  name                     = "dev"
  location                 = "europe-west4-b"
  remove_default_node_pool = true
  initial_node_count       = 1
  min_master_version       = "1.12.6-gke.7"
  enable_legacy_abac       = false
  maintenance_policy {
    daily_maintenance_window {
      start_time = "01:00"
    }
  }
  master_auth {
    username = ""
    password = ""
    client_certificate_config {
      issue_client_certificate = false
    }
  }
}
resource "google_container_node_pool" "dev-n1s2-pool" {
  name               = "dev"
  location           = "europe-west4-b"
  cluster            = "${google_container_cluster.dev.name}"
  initial_node_count = "7"
  autoscaling {
    min_node_count   = "4"
    max_node_count   = "14"
  }
  management {
    auto_repair  = "true"
  }
  version        = "1.12.6-gke.7"
  node_config {
    disk_type    = "pd-ssd"
    disk_size_gb = "30"
    metadata {
      disable-legacy-endpoints = "true"
    }
    preemptible  = "false"
    machine_type = "n1-standard-2"
    oauth_scopes = [ "https://www.googleapis.com/auth/compute", "https://www.googleapis.com/auth/devstorage.read_only", "https://www.googleapis.com/auth/logging.write", "https://www.googleapis.com/auth/monitoring" ]
    tags         = ["ssh-wan"]
  }
}

After cluster creation complete i always get master node update.

P.S. version 1.12.5 - same, and 1.11 too.

thefirstofthe300 · 2019-04-08T19:34:25Z

@yellowmegaman I suspect the issue here is that the master is resizing itself. On cluster creation, the master is generally size to only accomodate ~5 nodes. Since you're cluster has 7, the master is being upsized to accomodate those extra nodes. This autoscaling will happen regardless of what the maintenance window is set to since the autoscaling is required to maintain the health of the cluster and isn't really routine.

Note that the master currently only scales up so once the master reaches a certain size, it isn't scaled back down.

If you need an HA control plane, you should probably look into using a regional cluster. Otherwise, I suspect everything here is WAI.

yellowmegaman · 2019-04-09T08:05:44Z

@thefirstofthe300 thanks for shedding the light! But still, I can't just wait for terraform to bring up the cluster to continue with further automation here, need to implement some kind of timeout and check, that's what troubles me.

Next best thing i can come up with - docs probably should tell about possible resizing, since if you use recommended example with, for instance, batch job with timeout, batch job will fail during cluster resize.

ctrox · 2019-04-09T08:11:23Z

@thefirstofthe300 I actually ran into the same thing here: #3249

If you need an HA control plane, you should probably look into using a regional cluster. Otherwise, I suspect everything here is WAI.

This is exactly what I did but the Terraform provider does not handle this gracefully. The master is working fine on the side of kubernetes but Terraform fails because it is hardcoded to not continue if the cluster is in RECONCILING state.

danisla · 2019-04-30T23:22:41Z

One of the workarounds I found is to set the initial_node_count to match the expected number of nodes in the managed node pools. This way, the master is already right-sized after the default node pool is deleted and the managed pools are created so an upgrade operation is not triggered after completion.

Keep in mind when calculating initial_node_count with regional clusters that this value is per zone, so setting it to a value like 2 in a region with 3 zones creates 6 nodes.

binamov · 2019-04-30T23:41:27Z

I've done what Dan recommends, but then when I grow the initial node count number and apply, tf comes back happy, while cluster gets into RECONCILING mode (not ready to be happy) as it tries to grow the master. Hence I get a sleep problem described in OP.

…

On Tue, Apr 30, 2019, 16:22 Dan Isla ***@***.***> wrote: One of the workarounds I found is to set the initial_node_count to match the expected number of nodes in the managed node pools. This way, the master is already right-sized after the default node pool is deleted and the managed pools are created so an upgrade operation is not triggered after completion. Keep in mind when calculating initial_node_count with regional clusters that this value is per zone, so setting it to a value like 2 in a region with 3 zones creates 6 nodes. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#3385 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AACHOYIJPPSQIKWK55HCERDPTDIENANCNFSM4HD2TQZQ> .

ctrox · 2019-05-02T08:24:27Z

Thanks @danisla for that workaround. I got it to work by just setting the initial_node_count to whatever I want the final node count to be and set remove_default_node_pool to true. With that the masters are sized correctly from the beginning and deleting the default node pool won't cause the masters to scale down. I'm creating the custom node pools just after that is done (same tf apply) so I'm not sure if a longer wait in between would cause the masters to scale down again.

Context here: hashicorp/terraform-provider-google#3385 [#166876339] Signed-off-by: Akshay Mankar <amankar@pivotal.io>

singh-ajeet · 2019-10-09T11:39:56Z

I am also facing this issue. I crated a GKE cluster with below command, and after increasing the load (using JMeter) on my deployed service cluster is doing upgrade of master.
gcloud container clusters create ajeet-gke --zone us-east4-b --node-locations us-east4-b --machine-type n1-standard-8 --num-nodes 1 --enable-autoscaling --min-nodes 4 --max-nodes 16

naihsi · 2019-12-18T08:13:57Z

Hi, I also encountered this issue. my case is also to add two additional nodepools, and it upgrade automatically to block the followed deployment.

venkykuberan · 2020-01-17T23:14:17Z

I am also facing this issue. I crated a GKE cluster with below command, and after increasing the load (using JMeter) on my deployed service cluster is doing upgrade of master.
gcloud container clusters create ajeet-gke --zone us-east4-b --node-locations us-east4-b --machine-type n1-standard-8 --num-nodes 1 --enable-autoscaling --min-nodes 4 --max-nodes 16

@singh-ajeet looks like you trying in gcloud which is not in scope for this channel. Please raise the issue against glcoud team if you still have the issue

venkykuberan · 2020-01-17T23:24:53Z

@yellowmegaman Please let us know if you still face this issue or you want me to close it.
Also I am able to successfully create the GKE cluster with your config. Cluster completes within the timeout period after that don't see master getting upgraded.

Terraform v0.12.18
+ provider.google v3.4.0
+ provider.google-beta v3.3.0

yellowmegaman · 2020-01-19T20:19:35Z

@venkykuberan just tried code above again. Had only to edit metadata field, add '=' to make it work with 0.12.X.

All is just the same. After cluster and nodepool are created, wait for few mins. In my case it was 2 minutes when cluster started to upgrade:

Kubectl returning this:

The connection to the server 34.90.187.152 was refused - did you specify the right host or port?

So if I don't use null resource to wait some time and check connectivity to cluster endpoint, terraform will fail to apply resources to cluster.

For anyone interested, currently I'm using this code to bypass this issue, not ideal one, may break on larger nodepools due longer upgrade times:

#%# wait/check
resource "null_resource" "placeholder-wait-for-k8s-sleep" {
  depends_on = [module.placeholder-node-pool]
  provisioner "local-exec" {
    command = "echo 'sleep started'; sleep 180"
  }
}
resource "null_resource" "placeholder-wait-for-k8s" {
  depends_on = [null_resource.placeholder-wait-for-k8s-sleep]
  provisioner "local-exec" {
    command = "until nc -w1 -z ${module.placeholder.gke-cluster.endpoint} 443; do sleep 1; done && echo 'cluster seems to be ready'"
  }
}
#%# wait/check

And so you can add other k8s-related modules here with depends_on = [null_resource.placeholder-wait-for-k8s]

yellowmegaman · 2020-01-19T20:19:58Z

@venkykuberan just tried code above again. Had only to edit metadata field, add '=' to make it work with 0.12.X.

All is just the same. After cluster and nodepool are created, wait for few mins. In my case it was 2 minutes when cluster started to upgrade:

Kubectl returning this:

The connection to the server 34.90.187.152 was refused - did you specify the right host or port?

So if I don't use null resource to wait some time and check connectivity to cluster endpoint, terraform will fail to apply resources to cluster.

For anyone interested, currently I'm using this code to bypass this issue, not ideal one, may break on larger nodepools due longer upgrade times:

#%# wait/check
resource "null_resource" "placeholder-wait-for-k8s-sleep" {
  depends_on = [module.placeholder-node-pool]
  provisioner "local-exec" {
    command = "echo 'sleep started'; sleep 180"
  }
}
resource "null_resource" "placeholder-wait-for-k8s" {
  depends_on = [null_resource.placeholder-wait-for-k8s-sleep]
  provisioner "local-exec" {
    command = "until nc -w1 -z ${module.placeholder.gke-cluster.endpoint} 443; do sleep 1; done && echo 'cluster seems to be ready'"
  }
}
#%# wait/check

And so you can add other k8s-related modules here with depends_on = [null_resource.placeholder-wait-for-k8s]

venkykuberan · 2020-01-22T00:20:46Z

@yellowmegaman could you try initial_node_count = 7 on your cluster config and see you can avoid the upgrade as with that config, state was Running for me all the time (tried for about 5 mins) when i hit cluster from gcloud after the cluster creation is complete from terraform.

However coming to the core issue, Once Terraform gets the status as cluster complete it will close the loop and it will not have any visibility about what GKE is doing on the cluster in background until next refresh call finds any difference. If GKE modifies the cluster outside of the maintenance window and if it affects your automation flow an issue can be raised directly against GKE here.

Since you already have work-around in-place. Shall i go head and close this issue ?

thefirstofthe300 · 2020-01-22T01:20:24Z

What I'd love to see is the Terraform provider make a check to if it needs to wait for an ongoing operation on the cluster before attempting to apply anything. I'd envision this being solved in some way where the provider would wait for both the operation and Terraform application to either complete before the given Terraform application timeout or trigger the timeout. In the case of these small clusters that are generally used for some kind of development purpose, operations should finish up relatively quickly so I see it alleviating more pain in these kinds of cases and not necessarily introducing any kind of pain for operators of large clusters since they're going to be prod clusters that are almost certainly not going to be running operations outside of Terraform except in cases where a master needs maintenance due to unhealthiness or some other similar reason.

…

On Tue, Jan 21, 2020 at 4:20 PM venkykuberan ***@***.***> wrote: @yellowmegaman <https://github.com/yellowmegaman> could you try initial_node_count = 7 on your cluster config and see you can avoid the upgrade as with that config, state was Running for me all the time (tried for about 5 mins) when i hit cluster from gcloud after the cluster creation is complete from terraform. However coming to the core issue, Once Terraform gets the status as cluster complete it will close the loop and it will not have any visibility about what GKE is doing on the cluster in background until next refresh call finds any difference. If GKE modifies the cluster outside of the maintenance window and if it affects your automation flow an issue can be raised directly against GKE here <https://cloud.google.com/support/docs/issue-trackers>. Since you already have work-around in-place. Shall i go head and close this issue ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3385>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAYDZ4UIORRLJRVHGPQMSNLQ66GOBANCNFSM4HD2TQZQ> .

-- Danny Seymour dannyseeless@gmail.com

yellowmegaman · 2020-01-23T10:54:54Z

@venkykuberan Don't see any reason to try with initial node count = 7, since all users have different scenarios.
Actually same for workaround, it's working, but only for this size/count, for larger pools it may not work, at least the sleep limit.

I totally understand that this is happening outside terraform interaction loop, but was hoping to bring this issue to attention of someone developing GCP provider.

We can close it, since it isn't terraform fault, but then we'll have to admit that:
"Terraform can't consistently deploy both k8s cluster and workload on top of it."

And is hugely sad.

venkykuberan · 2020-01-24T22:34:26Z

@yellowmegaman i didn't mean to use constant value 7 for the node count. I wanted to try the same node_count for default node pool as well, so that master is sized correctly in the initial creation time (doesn't have to resize it later).

Also please let us know
is terraform successfully creating the cluster and exiting or timing out ?

After node pools are added, the cluster begins to scale-up and can cause inaccessibility to the k8s master URL. Workaround from hashicorp/terraform-provider-google#3385

venkykuberan · 2020-04-15T22:30:45Z

@yellowmegaman do you still want to keep the issue open ? or shall i close if you aren't looking anything from us here?

edwardmedia · 2023-09-20T18:06:48Z

No response. Assuming it is no longer an issue

github-actions · 2023-10-21T02:00:24Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

ghost added the enhancement label Apr 5, 2019

rileykarson added the waiting-response label Apr 5, 2019

ghost removed the waiting-response label Apr 7, 2019

ctrox mentioned this issue May 3, 2019

Add initial_node_count variable for scaling the default node pool terraform-google-modules/terraform-google-kubernetes-engine#149

Merged

aLekSer mentioned this issue May 13, 2019

Add AKS, GKE and Helm terraform modules googleforgames/agones#756

Merged

rileykarson mentioned this issue Jun 7, 2019

Allow container cluster to wait for status RUNNING on import. #3726

Closed

rileykarson added bug and removed enhancement labels Jun 11, 2019

dleehr mentioned this issue Aug 15, 2019

Jobs with many parallel tasks cause GKE cluster failures Duke-GCB/calrissian#96

Open

akshaymankar pushed a commit to cloudfoundry/eirini-ci that referenced this issue Oct 1, 2019

Make sure GKE creates properly sized masters

651539d

Context here: hashicorp/terraform-provider-google#3385 [#166876339] Signed-off-by: Akshay Mankar <amankar@pivotal.io>

venkykuberan self-assigned this Jan 17, 2020

venkykuberan added the waiting-response label Jan 17, 2020

ghost removed the waiting-response label Jan 19, 2020

venkykuberan added the waiting-response label Jan 22, 2020

ghost removed the waiting-response label Jan 22, 2020

venkykuberan added the waiting-response label Jan 24, 2020

steved mentioned this issue Feb 11, 2020

workaround initial node count issues dominodatalab/terraform-gcp-gke#52

Merged

rileykarson unassigned venkykuberan Jul 19, 2021

rileykarson added the service/container label Jul 22, 2022

roaks3 added the forward/review In review; remove label to forward label Aug 17, 2023

edwardmedia closed this as completed Sep 20, 2023

github-actions bot removed the waiting-response label Sep 20, 2023

github-actions bot locked as resolved and limited conversation to collaborators Oct 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrading cluster master after cluster creation completes #3385

Upgrading cluster master after cluster creation completes #3385

yellowmegaman commented Apr 5, 2019

rileykarson commented Apr 5, 2019

yellowmegaman commented Apr 7, 2019

thefirstofthe300 commented Apr 8, 2019

yellowmegaman commented Apr 9, 2019

ctrox commented Apr 9, 2019

danisla commented Apr 30, 2019

binamov commented Apr 30, 2019 via email

ctrox commented May 2, 2019

singh-ajeet commented Oct 9, 2019

naihsi commented Dec 18, 2019

venkykuberan commented Jan 17, 2020

venkykuberan commented Jan 17, 2020

yellowmegaman commented Jan 19, 2020

yellowmegaman commented Jan 19, 2020

venkykuberan commented Jan 22, 2020

thefirstofthe300 commented Jan 22, 2020 via email

yellowmegaman commented Jan 23, 2020

venkykuberan commented Jan 24, 2020

venkykuberan commented Apr 15, 2020

edwardmedia commented Sep 20, 2023

github-actions bot commented Oct 21, 2023

Upgrading cluster master after cluster creation completes #3385

Upgrading cluster master after cluster creation completes #3385

Comments

yellowmegaman commented Apr 5, 2019

Community Note

Description

New or Affected Resource(s)

Potential Terraform Configuration

References

rileykarson commented Apr 5, 2019

yellowmegaman commented Apr 7, 2019

thefirstofthe300 commented Apr 8, 2019

yellowmegaman commented Apr 9, 2019

ctrox commented Apr 9, 2019

danisla commented Apr 30, 2019

binamov commented Apr 30, 2019 via email

ctrox commented May 2, 2019

singh-ajeet commented Oct 9, 2019

naihsi commented Dec 18, 2019

venkykuberan commented Jan 17, 2020

venkykuberan commented Jan 17, 2020

yellowmegaman commented Jan 19, 2020

yellowmegaman commented Jan 19, 2020

venkykuberan commented Jan 22, 2020

thefirstofthe300 commented Jan 22, 2020 via email

yellowmegaman commented Jan 23, 2020

venkykuberan commented Jan 24, 2020

venkykuberan commented Apr 15, 2020

edwardmedia commented Sep 20, 2023

github-actions bot commented Oct 21, 2023