Backport upstream changes to watch cache enablement #16398

smarterclayton · 2017-09-17T03:50:39Z

Disables the watch cache for most resources by default, except those accessed by many clients. This has been shown to have minor impacts on the production workload.

Fixes #16112

stevekuznetsov · 2017-09-17T04:44:51Z

/unassign

smarterclayton · 2017-09-18T14:19:28Z

/retest

Backport the change that allows a global default watch cache size as well as being able to disable an individual watch cache item

Remove some complexity in RESTOptionsGetter and add default watch cache sizes for resources that are read by nodes.

Any resource named by the heuristics gets a watch cache by default. Admins can restore the previous behavior by setting `--default-watch-cache-size` to a positive integer. This reduces the amount of total memory allocated on large cluster significantly at minor cost in CPU on the etcd process and an increase in network bandwidth to etcd.

smarterclayton · 2017-09-20T00:08:13Z

/retest

deads2k · 2017-09-20T17:09:43Z

pkg/cmd/server/kubernetes/master/master_config.go

@@ -150,7 +148,7 @@ func BuildKubeAPIserverOptions(masterConfig configapi.MasterConfig) (*kapiserver
 	server.Etcd.StorageConfig.KeyFile = masterConfig.EtcdClientInfo.ClientCert.KeyFile
 	server.Etcd.StorageConfig.CertFile = masterConfig.EtcdClientInfo.ClientCert.CertFile
 	server.Etcd.StorageConfig.CAFile = masterConfig.EtcdClientInfo.CA
-	server.Etcd.DefaultWatchCacheSize = DefaultWatchCacheSize
+	server.Etcd.DefaultWatchCacheSize = 0


This is what is setting us to "off by default", right?

deads2k · 2017-09-20T17:10:26Z

pkg/cmd/server/kubernetes/master/master_config.go

@@ -507,6 +505,20 @@ func buildKubeApiserverConfig(
 		return originLongRunningRequestRE.MatchString(r.URL.Path) || kubeLongRunningFunc(r, requestInfo)
 	}

+	if apiserverOptions.Etcd.EnableWatchCache {
+		glog.V(2).Infof("Initializing cache sizes based on %dMB limit", apiserverOptions.GenericServerRunOptions.TargetRAMMB)
+		sizes := cachesize.NewHeuristicWatchCacheSizes(apiserverOptions.GenericServerRunOptions.TargetRAMMB)


do we set this target RAMMB to anything by default?

do we set this target RAMMB to anything by default?

I'm not seeing where we write a default here, which would set all the heuristic ones to zero by default, right?

There's a "min" function on the heuristic so we always get something even at 0

smarterclayton · 2017-09-23T14:42:53Z

Anything else?

smarterclayton · 2017-09-25T18:33:40Z

ping @deads2k

deads2k · 2017-09-25T18:46:01Z

/lgtm

openshift-merge-robot · 2017-09-25T18:46:20Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, smarterclayton

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~OWNERS~~ [deads2k,smarterclayton]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

openshift-merge-robot · 2017-09-26T02:00:01Z

/test all [submit-queue is verifying that this PR is safe to merge]

openshift-merge-robot · 2017-09-26T02:23:39Z

Automatic merge from submit-queue (batch tested with PRs 16546, 16398, 16157)

openshift-ci-robot · 2017-09-26T03:16:47Z

@smarterclayton: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
ci/openshift-jenkins/extended_conformance_gce	`01aeb23`	link	`/test extended_conformance_gce`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

jeremyeder · 2017-09-26T17:20:04Z

@openshift/svt

openshift-merge-robot assigned stevekuznetsov and deads2k Sep 17, 2017

openshift-merge-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 17, 2017

openshift-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 17, 2017

openshift-ci-robot unassigned stevekuznetsov Sep 17, 2017

smarterclayton force-pushed the disablecache branch from 38ea496 to ef6d782 Compare September 17, 2017 17:44

smarterclayton added 3 commits September 18, 2017 14:09

UPSTREAM: 52112: Allow watch cache disablement per type

25e81b6

Backport the change that allows a global default watch cache size as well as being able to disable an individual watch cache item

React to changes in watch cache initialization

929dc82

Remove some complexity in RESTOptionsGetter and add default watch cache sizes for resources that are read by nodes.

smarterclayton force-pushed the disablecache branch from ef6d782 to 01aeb23 Compare September 18, 2017 18:09

deads2k reviewed Sep 20, 2017

View reviewed changes

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Sep 25, 2017

openshift-merge-robot merged commit fe04a6f into openshift:master Sep 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport upstream changes to watch cache enablement #16398

Backport upstream changes to watch cache enablement #16398

smarterclayton commented Sep 17, 2017

stevekuznetsov commented Sep 17, 2017

smarterclayton commented Sep 18, 2017

smarterclayton commented Sep 20, 2017

deads2k Sep 20, 2017

smarterclayton Sep 20, 2017

deads2k Sep 20, 2017

deads2k Sep 20, 2017

smarterclayton Sep 20, 2017

smarterclayton commented Sep 23, 2017

smarterclayton commented Sep 25, 2017

deads2k commented Sep 25, 2017

openshift-merge-robot commented Sep 25, 2017

openshift-merge-robot commented Sep 26, 2017

openshift-merge-robot commented Sep 26, 2017

openshift-ci-robot commented Sep 26, 2017 •

edited

Loading

jeremyeder commented Sep 26, 2017

Backport upstream changes to watch cache enablement #16398

Backport upstream changes to watch cache enablement #16398

Conversation

smarterclayton commented Sep 17, 2017

stevekuznetsov commented Sep 17, 2017

smarterclayton commented Sep 18, 2017

smarterclayton commented Sep 20, 2017

deads2k Sep 20, 2017

Choose a reason for hiding this comment

smarterclayton Sep 20, 2017

Choose a reason for hiding this comment

deads2k Sep 20, 2017

Choose a reason for hiding this comment

deads2k Sep 20, 2017

Choose a reason for hiding this comment

smarterclayton Sep 20, 2017

Choose a reason for hiding this comment

smarterclayton commented Sep 23, 2017

smarterclayton commented Sep 25, 2017

deads2k commented Sep 25, 2017

openshift-merge-robot commented Sep 25, 2017

openshift-merge-robot commented Sep 26, 2017

openshift-merge-robot commented Sep 26, 2017

openshift-ci-robot commented Sep 26, 2017 • edited Loading

jeremyeder commented Sep 26, 2017

openshift-ci-robot commented Sep 26, 2017 •

edited

Loading