Proposal: Allow Setting Cache Miss Policy in Cache Options #2397

stevekuznetsov · 2023-07-05T14:10:56Z

Today, when a user gets a client.Client from a Manager (with mgr.GetClient()), they get a cache-backed client that has some surprising behavior - when a cache miss occurs, an informer is spun up to feed data into the cache. This means that one client.Get() call for one resource will, by default, start a cluster-scoped watch on all objects of that type and keep them in memory forever. While this is an understandable way for the cache to work, I've seen it bite folks who did not expect this behavior. While I don't think that this should be the default, at a minimum I'd love to see the cache miss policy be configurable at creation time so that folks can opt out of this behavior. Today it is possible to opt out of caching particular resources, but that's done with a hard-coded deny-list, which means if you ever forget to update that list, you get bitten.

I'd like to propose adding a new field in the cache.Options:

// pkg/cache/cache.go

// Options are the optional arguments for creating a new InformersMap object.
type Options struct {
    // ...
    
    // MissPolicy determines how the cache should behave when a Get finds no
    // entry in the cache or a List captures no items. See the CacheMissPolicies for
    // documentation for each policy.
    MissPolicy CacheMissPolicy
}

type CacheMissPolicy string

const (
    // Forward configures the cache to forward a cache miss to the client, unchanged,
    // meaning that the client gets an errors.NotFound for a Get and an empty response
    // for a List.
    Forward CacheMissPolicy = "forward"

    // Backfill configures the cache to spin up an informer for a resource when the first
    // request for that resource comes in. This means that a Get or a List may take longer
    // than normal to succeed on the first invocation as the cache waits until it's fully back-
    // filled.
    Backfill CacheMissPolicy = "backfill"

    // LiveLookup configures the cache to issue a live client call to the server for requests
    // that do not correspond to resources that have been explicitly configured to be cached.
    LiveLookup CacheMissPolicy = "liveLookup"
)

We can keep the default as "backfill" to not change behavior.

/cc @alvaroaleman @vincepri

The text was updated successfully, but these errors were encountered:

stevekuznetsov · 2023-07-05T14:25:11Z

@alvaroaleman points out that LiveLookup would have weird semantics with configured label selectors. Likely better to remove that option.

Also, for Forward ... the user needs to be able to distinguish between "not found" and "no resource registered"... so, updated proposal:

type CacheMissPolicy string

const (
    // Fail configures the cache to return a sentinel error when a user requests a resource
    // the cache is not configured to hold. This error is distinct from an errors.NotFound
    // and should be considered a programming error.
    Fail CacheMissPolicy = "fail"

    // Backfill configures the cache to spin up an informer for a resource when the first
    // request for that resource comes in. This means that a Get or a List may take longer
    // than normal to succeed on the first invocation as the cache waits until it's fully back-
    // filled.
    Backfill CacheMissPolicy = "backfill"
)

Perhaps "fail" could also just runtime panic, since it's a programming error.

stevekuznetsov · 2023-07-06T12:49:53Z

Perhaps "fail" could also just runtime panic, since it's a programming error.

Just realizing that the better proposal would be that the client.Client wrapping the cache could use that error to do a live lookup.

vincepri · 2023-07-06T15:44:23Z

Perhaps "fail" could also just runtime panic, since it's a programming error.

Just realizing that the better proposal would be that the client.Client wrapping the cache could use that error to do a live lookup.

Which error?

vincepri · 2023-07-06T15:44:58Z

Perhaps "fail" could also just runtime panic, since it's a programming error.

Maybe we could have a Strict mode, and have Backfill to be the default?

stevekuznetsov · 2023-07-06T17:14:41Z

@vincepri

Which error?

I'm proposing:

    // Fail configures the cache to return a sentinel error when a user requests a resource
    // the cache is not configured to hold. This error is distinct from an errors.NotFound
    // and should be considered a programming error.
    Fail CacheMissPolicy = "fail"

This sounds like what you mean by Strict mode - yes?

stevekuznetsov · 2023-07-06T17:16:10Z

So the flows for a client.Client with an underlying cache in CacheMissPolicy = Fail would be:

query cache, get a hit -> return that value
query cache, get a 404 -> return that 404
query cache, get the ResourceNotCached, do a live call, return that

stevekuznetsov · 2023-07-12T16:41:05Z

@vincepri @alvaroaleman do you think this is fleshed-out enough for me to attempt an implementation? Any other feedback?

vincepri · 2023-07-12T16:52:32Z

Yeah I'm generally +1

alvaroaleman · 2023-07-12T16:54:12Z

Yeah, sgtm

Just realizing that the better proposal would be that the client.Client wrapping the cache could use that error to do a live lookup.

That is a good idea, but in the interest of not having too much magic, maybe make it opt-in?

stevekuznetsov mentioned this issue Jul 13, 2023

✨: pkg/cache: add options for cache miss policy #2406

Merged

k8s-ci-robot closed this as completed in #2406 Aug 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Allow Setting Cache Miss Policy in Cache Options #2397

Proposal: Allow Setting Cache Miss Policy in Cache Options #2397

stevekuznetsov commented Jul 5, 2023 •

edited

Loading

stevekuznetsov commented Jul 5, 2023

stevekuznetsov commented Jul 6, 2023

vincepri commented Jul 6, 2023

vincepri commented Jul 6, 2023

stevekuznetsov commented Jul 6, 2023

stevekuznetsov commented Jul 6, 2023

stevekuznetsov commented Jul 12, 2023

vincepri commented Jul 12, 2023

alvaroaleman commented Jul 12, 2023

Proposal: Allow Setting Cache Miss Policy in Cache Options #2397

Proposal: Allow Setting Cache Miss Policy in Cache Options #2397

Comments

stevekuznetsov commented Jul 5, 2023 • edited Loading

stevekuznetsov commented Jul 5, 2023

stevekuznetsov commented Jul 6, 2023

vincepri commented Jul 6, 2023

vincepri commented Jul 6, 2023

stevekuznetsov commented Jul 6, 2023

stevekuznetsov commented Jul 6, 2023

stevekuznetsov commented Jul 12, 2023

vincepri commented Jul 12, 2023

alvaroaleman commented Jul 12, 2023

stevekuznetsov commented Jul 5, 2023 •

edited

Loading