feat: make id_token mutator cache configurable #1177

David-Wobrock · 2024-08-09T14:51:28Z

Make the id_token mutator cache configurable:

can be enabled/disabled
can set the max_cost

Changes to default configuration:
Previous:

NumCounters: 10000
MaxCost: 1 << 25

New:

NumCounters: maxCost * 10
MaxCost: 1 << 25
Cost function: JWT length

Related issue(s)

Follow up of #1171 and #1209 (and #1210 too a bit).

Related docs PR: ory/docs#1820

Checklist

I have read the contributing guidelines.
I have referenced an issue containing the design document if my change
introduces a new feature.
I am following the
contributing code guidelines.
I have read the security policy.
I confirm that this pull request does not address a security
vulnerability. If this pull request addresses a security vulnerability, I
confirm that I got the approval (please contact
security@ory.sh) from the maintainers to push
the changes.
I have added tests that prove my fix is effective or that my feature
works.
I have added or changed the documentation.

Further Comments

Could probably be subject to a minor version bump, since there's a behaviour change.

David-Wobrock · 2024-08-09T15:11:19Z

pipeline/mutate/mutator_id_token.go

+			BufferItems: 64,
+			Cost: func(value interface{}) int64 {
+				return 1
+			},
+			IgnoreInternalCost: true,


To be discussed if this is really the best strategy for id_token.

I'm less familiar with other use cases of generated id_token and also how ristretto computes the "cost".

For now, this is copied from the AuthN OAuth2 introspection handler.
But perhaps saying that each id_token = 1 cost would cache too many id_tokens.

I'd like @aeneasr's opinion here :)

In the current state it does not work, but we could at least do

NumCounters: 10000, BufferItems: 64, MaxCost: cost,

to keep the same behaviour as previously.

cost = 1 << 25

as the default is definitely too many now that the cost function is 1

Since JWTs can be quite long it may make more sense to calculate the cost based on the string length to have an approximation of storage use

That could make sense indeed 👍
Then the default limit of 1 << 25 (~33.5 million), if we estimate an average length of JWT of a few thousands char, we should be able to store some tens of thousands of keys - which sounds reasonable.

pipeline/mutate/mutator_id_token.go

alnr · 2024-09-13T11:47:36Z

spec/config.schema.json

+        "cache": {
+          "additionalProperties": false,
+          "type": "object",
+          "properties": {


We have a bunch of other caches in the config schema already. Can you make your change so that it is more similar (identical) to those other cache configurations? The max_cost parameter in particular is really opaque and its impossible to come up with a reasonable value without knowing the implementation.

I agree with the remark on max_cost 👍
A value with similar semantics is used today for the OAuth2 introspection authenticator, so I mainly mimicked this one.
Else, there is also max_tokens, used by the OAuth2 client credentials authenticator.

I think the reason for the cost instead comes from the fact that the cached objects have variable lengths, so storing a certain number of objects will result in a different cache memory usage depending on your config.

However, one can also make the decision to let the user make this decision anyway :)
Let me know what makes most sense to you, and Ory's strategy around these questions (should the product have the same defaults for everyone, or is the user trusted to configure this accordingly).

For the enabled/disabled value, I didn't re-use the existing

"handlerSwitch": { "title": "Enabled", "type": "boolean", "default": false, "examples": [true], "description": "En-/disables this component." },

to avoid introducing a breaking change.

Currently, the id_token cache is enabled by default, and didn't wanna change the default value to false - so I couldn't re-use this configuration.

And finally, this cache config has no TTL, because the id_token mutator already has a TTL config value for the JWT expiration date.
It make sense to me to re-use the same value => cache for 15 min if the JWT is valid 15min.

Ideally, we would probably have one namespaced cache for all of oathkeeper and then use that everywhere, but this is good for now!

alnr · 2024-09-13T11:48:46Z

pipeline/mutate/mutator_id_token.go

+			Token:     token,
+		},
+		0,
+		ttl,


In other caches, I believe we set the TTL to min(TTL, time.Until(expiresAt). That would make sense here too IMO.

They should be same normally. Since calling this method does:

now := time.Now().UTC() exp := now.Add(ttl) [...] a.tokenToCache(c, session, templateClaims, ttl, exp, signed)

Or what am I missing? :)

pipeline/mutate/mutator_id_token.go

aeneasr

This only changes cache sizes and not the actual caching function itself, right? If so I think we’re very close!

David-Wobrock · 2024-09-16T08:12:56Z

This only changes cache sizes and not the actual caching function itself, right? If so I think we’re very close!

I'm glad to read this 😁

At the time of writing this patch does:

behaviour changes:
- set a TTL on keys in id_token mutator cache
- set a cost function in the id_token mutator cache
new configuration:
- allow disabling id_token mutator cache
- allow changing the cost of id_okten mutator cache

David-Wobrock · 2024-10-28T09:32:07Z

Hey @aeneasr I hope you're well :)

Did you get a chance to have a look again? 😇
We are still running our forked and patched Oathkeeper, but we would obviously prefer running the Ory upstream version.

Perhaps this can make it into the next version?

David-Wobrock · 2024-12-13T15:53:43Z

Hello @aeneasr @alnr,

A pity to see that we didn't make it into the latest Oathkeeper release 😞

Feel free to leave additional review to push this over the finish line 💪 I think we are not far 😄

David-Wobrock · 2024-12-28T11:52:20Z

pipeline/mutate/mutator_id_token.go

+		cost = 1 << 25
+	}
+
+	if a.tokenCache == nil || a.tokenCache.MaxCost() != cost {


The a.tokenCache.MaxCost() != cost condition is mainly for unit tests, in order to be able to test other max_cost values on the cache config.

On prod envs, this should not happen, and we'll keep skipping this branch.

So basically this does hot reloading if the cost changes?

aeneasr · 2025-01-02T09:49:19Z

pipeline/mutate/mutator_id_token.go

+
+	if a.tokenCache == nil || a.tokenCache.MaxCost() != cost {
+		cache, err := ristretto.NewCache(&ristretto.Config[string, *idTokenCacheContainer]{
+			NumCounters: cost * 10,


So counters should actually be 10 * number of items. Since the cost no longer == number of items, it's not trivial to set this. But this is probably too high?

Yeah, being able to estimate the number of JWTs that we can fit in the given max size relates to the average size of JWTs. But it's gonna be tough to get an "average JWT size" when the payload is configurable 😅

What would be a better guess in your opinion?
I pushed this for now, which would leave a bit less extra space through the number of counters.

Suggested change

NumCounters: cost * 10,

NumCounters: cost * 4,

.schema/config.schema.json

aeneasr

Generally LGTM, a few comments

aeneasr · 2025-01-03T09:13:09Z

Looks like we're now failing some cache tests: https://github.com/ory/oathkeeper/actions/runs/12588707987/job/35087289005?pr=1177

David-Wobrock · 2025-01-03T10:41:11Z

Looks like we're now failing some cache tests: https://github.com/ory/oathkeeper/actions/runs/12588707987/job/35087289005?pr=1177

I think we're good again 🙂

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 6146085 to 6ad3106 Compare August 9, 2024 15:01

David-Wobrock mentioned this pull request Aug 9, 2024

feat: add id_token mutator cache config ory/docs#1820

Merged

6 tasks

David-Wobrock commented Aug 9, 2024

View reviewed changes

David-Wobrock marked this pull request as ready for review August 9, 2024 15:11

David-Wobrock requested a review from aeneasr as a code owner August 9, 2024 15:11

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 6ad3106 to 7a46fd0 Compare August 22, 2024 11:57

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 7a46fd0 to 2b89d1e Compare August 29, 2024 12:56

David-Wobrock commented Aug 29, 2024

View reviewed changes

pipeline/mutate/mutator_id_token.go Outdated Show resolved Hide resolved

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 2f91f5e to e78b048 Compare September 13, 2024 07:44

alnr requested changes Sep 13, 2024

View reviewed changes

aeneasr reviewed Sep 14, 2024

View reviewed changes

David-Wobrock requested review from aeneasr and alnr September 20, 2024 12:28

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 8c5806e to d48aa10 Compare October 8, 2024 07:44

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from d48aa10 to 4229ee6 Compare October 28, 2024 09:21

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch 2 times, most recently from 9c749a7 to 5ffc3e7 Compare November 11, 2024 12:30

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 5ffc3e7 to a24b0a2 Compare December 13, 2024 15:02

David-Wobrock mentioned this pull request Dec 28, 2024

fix: memory leak in id_token mutator cache #1209

Merged

7 tasks

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch 3 times, most recently from 95d5a75 to d2b9074 Compare December 28, 2024 11:43

David-Wobrock commented Dec 28, 2024

View reviewed changes

aeneasr reviewed Jan 2, 2025

View reviewed changes

.schema/config.schema.json Outdated Show resolved Hide resolved

aeneasr reviewed Jan 2, 2025

View reviewed changes

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from d2b9074 to e390ea5 Compare January 2, 2025 11:04

David-Wobrock requested a review from a team as a code owner January 2, 2025 11:04

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch 2 times, most recently from 124120c to 30c02fa Compare January 2, 2025 21:01

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 30c02fa to d1a6359 Compare January 3, 2025 10:30

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from d1a6359 to 9075a49 Compare January 22, 2025 10:24

feat: make id_token mutator cache configurable

461e6a0

David-Wobrock force-pushed the feat/id-token-mutator-cache-configurable branch from 9075a49 to 461e6a0 Compare February 7, 2025 09:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: make id_token mutator cache configurable #1177

feat: make id_token mutator cache configurable #1177

David-Wobrock commented Aug 9, 2024 •

edited

Loading

David-Wobrock Aug 9, 2024

David-Wobrock Aug 26, 2024

aeneasr Aug 28, 2024

aeneasr Aug 28, 2024

David-Wobrock Aug 29, 2024

alnr Sep 13, 2024

David-Wobrock Sep 13, 2024

aeneasr Sep 14, 2024

alnr Sep 13, 2024

David-Wobrock Sep 13, 2024

aeneasr Sep 14, 2024

aeneasr left a comment

David-Wobrock commented Sep 16, 2024

David-Wobrock commented Oct 28, 2024

David-Wobrock commented Dec 13, 2024

David-Wobrock Dec 28, 2024

aeneasr Jan 2, 2025

aeneasr Jan 2, 2025

David-Wobrock Jan 2, 2025

aeneasr left a comment

aeneasr commented Jan 3, 2025

David-Wobrock commented Jan 3, 2025

feat: make id_token mutator cache configurable #1177

Are you sure you want to change the base?

feat: make id_token mutator cache configurable #1177

Conversation

David-Wobrock commented Aug 9, 2024 • edited Loading

Related issue(s)

Checklist

Further Comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aeneasr left a comment

Choose a reason for hiding this comment

David-Wobrock commented Sep 16, 2024

David-Wobrock commented Oct 28, 2024

David-Wobrock commented Dec 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aeneasr left a comment

Choose a reason for hiding this comment

aeneasr commented Jan 3, 2025

David-Wobrock commented Jan 3, 2025

David-Wobrock commented Aug 9, 2024 •

edited

Loading