Add memory test doc for 1.18 #1196

tiffanny29631 · 2024-04-23T00:18:44Z

No description provided.

karlkfi · 2024-04-23T00:22:38Z

docs/memory-usage-reduce-1160-1180.md

@@ -0,0 +1,34 @@
+# Config Sync Memory Usage Reduction v1.16 vs v1.18
+
+Config Sync v1.18 contains change to stop loading OpenAPI for schema validations when Config Sync admission webhook is not enabled. Here are some test results that shows the reduction in the memory usage.


Can you explain the problem first?

How does using OpenAPI increase the memory? In what use cases?

@nan-yu did profiling previously and loading the OpenAPI consumes a sizable fraction of the memory.

karlkfi · 2024-04-23T00:24:40Z

docs/memory-usage-reduce-1160-1180.md

+
+#### Test Repository
+
+[Config Sync Quickstart](https://github.com/GoogleCloudPlatform/anthos-config-management-samples/tree/main/config-sync-quickstart)


How many objects is this syncing? How many different resource types? How many namespace?

These numbers also affect memory usage. So having tests with varying amounts would help show how much reduction is gained at various scales. Is the % reduction constant or dependent on other variables?

Within this specific repo there are 30 objects with various kinds for the rootsync, I'll test with a larger count to see if the reduction rate changes. Not sure if the resource type would make a difference, worth a try. Cluster CRDs are the primary cause of decreased memory usage in this case. This reduction scales with CRD load, as schema validation processes all of them.

Does memory usage scale just with the CRDs managed by the current reconciler or also with any CRDs on the cluster?

FWIW, the applier watch cache memory usage increases with any objects on the cluster in an overlapping resources and namespace (it's actually even more complicated than this due to RootSyncs often using cluster-scoped watches). So I'm trying to figure out how isolated your tests are on those dimentions.

The usage in this case scales with the any CRDs on the cluster, not the CRDs managed by the reconciler. With the test CRDs on the AutoPilot clusters the reconciler will get OOM killed with the default memory request / limit.

How many CRDs can it handle with the default resources on autopilot?

Does the size of the CRDs matter?

We could enhance the dataset with your work in #1187 later.

tiffanny29631 · 2024-04-23T23:40:33Z

/retest-required

janetkuo

/lgtm

google-oss-prow · 2024-05-07T20:08:40Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: janetkuo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [janetkuo]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

tiffanny29631 · 2024-05-07T20:08:46Z

/hold

janetkuo · 2024-05-07T20:20:29Z

This doc is good for now, given that it includes the scope of tests being performed. We can add more types of benchmarking tests later.

tiffanny29631 · 2024-05-07T20:39:19Z

/retest-required

Add memory test doc for 1.18

1325b04

tiffanny29631 requested review from mikebz and nan-yu April 23, 2024 00:18

google-oss-prow bot requested a review from karlkfi April 23, 2024 00:18

google-oss-prow bot added the size/M label Apr 23, 2024

karlkfi reviewed Apr 23, 2024

View reviewed changes

janetkuo approved these changes May 7, 2024

View reviewed changes

google-oss-prow bot assigned janetkuo May 7, 2024

google-oss-prow bot added the lgtm label May 7, 2024

google-oss-prow bot added the approved label May 7, 2024

google-oss-prow bot merged commit 263bc73 into GoogleContainerTools:main May 7, 2024
3 checks passed

tiffanny29631 deleted the mem-doc branch June 4, 2024 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add memory test doc for 1.18 #1196

Add memory test doc for 1.18 #1196

tiffanny29631 commented Apr 23, 2024

karlkfi Apr 23, 2024

janetkuo May 7, 2024

karlkfi Apr 23, 2024

tiffanny29631 Apr 23, 2024

karlkfi Apr 23, 2024

tiffanny29631 Apr 23, 2024

karlkfi Apr 24, 2024

tiffanny29631 May 7, 2024

tiffanny29631 commented Apr 23, 2024

janetkuo left a comment

google-oss-prow bot commented May 7, 2024

tiffanny29631 commented May 7, 2024

janetkuo commented May 7, 2024 •

edited

Loading

tiffanny29631 commented May 7, 2024

		@@ -0,0 +1,34 @@
		# Config Sync Memory Usage Reduction v1.16 vs v1.18

		Config Sync v1.18 contains change to stop loading OpenAPI for schema validations when Config Sync admission webhook is not enabled. Here are some test results that shows the reduction in the memory usage.


		#### Test Repository

		[Config Sync Quickstart](https://github.com/GoogleCloudPlatform/anthos-config-management-samples/tree/main/config-sync-quickstart)

Add memory test doc for 1.18 #1196

Add memory test doc for 1.18 #1196

Conversation

tiffanny29631 commented Apr 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tiffanny29631 commented Apr 23, 2024

janetkuo left a comment

Choose a reason for hiding this comment

google-oss-prow bot commented May 7, 2024

tiffanny29631 commented May 7, 2024

janetkuo commented May 7, 2024 • edited Loading

tiffanny29631 commented May 7, 2024

janetkuo commented May 7, 2024 •

edited

Loading