Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase performance test cluster size #3559

Merged
merged 1 commit into from
Dec 27, 2023

Conversation

gongmax
Copy link
Collaborator

@gongmax gongmax commented Dec 26, 2023

What type of PR is this?

Uncomment only one /kind <> line, press enter to put that in a new line, and remove leading whitespace from that line:

/kind breaking
/kind bug
/kind cleanup
/kind documentation
/kind feature

/kind hotfix

/kind release

What this PR does / Why we need it:
Performance test timed out because game servers cannot be created due to lack of CPU. Increasing the test cluster size to fix it.

Which issue(s) this PR fixes:

Closes #

Special notes for your reviewer:

@github-actions github-actions bot added kind/hotfix Hotfixes for issues against release size/XS labels Dec 26, 2023
@agones-bot
Copy link
Collaborator

Build Succeeded 👏

Build Id: 962b94f3-0457-4bdf-9c5b-2b4ccae902bc

The following development artifacts have been built, and will exist for the next 30 days:

A preview of the website (the last 30 builds are retained):

To install this version:

  • git fetch https://github.com/googleforgames/agones.git pull/3559/head:pr_3559 && git checkout pr_3559
  • helm install agones ./install/helm/agones --namespace agones-system --set agones.image.registry=us-docker.pkg.dev/agones-images/ci --set agones.image.tag=1.38.0-dev-1052083-amd64

@@ -53,8 +53,8 @@ module "gke_standard_cluster" {
location = each.value[0]
overrideName = format("standard-performance-test-cluster-%s", replace(each.key, ".", "-"))
releaseChannel = each.value[1]
machineType = "n2-standard-2"
initialNodeCount = 300
machineType = "e2-standard-4"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is going to double the number of cores in the test cluster. Do you have autoscaling enabled to scale the number of nodes down when it isn't in use?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry I didn't make it quite clear in the PR description. The current test cluster is already using type "e2-standard-4" actually. The code here drifted from the fact by some mistake. So the only actual change in this PR is to increase the size to 310.
Good point to enable autoscaling. I will test and make a quick follow up PR for it.

machineType = "n2-standard-2"
initialNodeCount = 300
machineType = "e2-standard-4"
initialNodeCount = 310
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To piggyback on the above point, if we enable node autoscaling, then we should adjust the max number of allowed nodes as the initial node count wouldn't make any difference.

@roberthbailey
Copy link
Member

As a hotfix this seems reasonable, but I'd like to make sure we don't have a bunch of idle nodes sitting around when the tests aren't running.

@gongmax gongmax merged commit b74169c into googleforgames:main Dec 27, 2023
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/hotfix Hotfixes for issues against release size/XS
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants