Switch cache to custom without write-lock for reads. #5576

absoludity · 2022-10-27T02:09:18Z

Signed-off-by: Michael Nelson minelson@vmware.com

Follows on from #5518, this time replacing the cached package with a custom credential cache.

Description of the change

After further digging, I found that one cause of the slow handling of 50 concurrent requests going through the pinniped-proxy was that:

We were caching a function with an async/await signature which means that even the cached version must have that signature as well - which means a blocking i/o call (which switches the task), and
The Cached trait specifies that even a cache_get operation mutates the cache (in our case, just for statistics of hits/misses), which, as a result, requires acquiring a write lock to the cache to read a cached value.

For more details, please see the discussion with the Cached author.

To avoid both of those issues, this PR:

Adds a cache module that provides a generic read/write LockableCache (for multiple readers, single writer) and builds on that with a PruningCache that will prune entries (given a test function) when they should no longer be cached,
Uses (1) to create a single CredentialCache on startup (in main.rs) specifically for caching TokenCredentialRequest objects and pruning expired entries, and then passes this through for use in different threads concurrently.
Uses the cache to fetch the credentials.

Benefits

Fetching from the cache is now non-blocking (generally, except when an entry is being added) and so leads to less task switching, improving the total query time by ~2s (down to 3-4).

There is still something else using significant CPU when creating the client itself (cert-related), which I'm investigating now in a separate PR.

Possible drawbacks

Applicable issues

Ref Improve response times in Kubeapps APIs when using Pinniped-proxy #5407

Additional information

Example log when using RUST_LOG=info,pinniped_proxy::pinniped=debug which shows the cache being used after the first request. I've not included it in the output generally, but the cache get is now always under a millisecond. As above, the significant delays (some calls to prepare_and_call_pinniped_exchange only 4ms, others 98ms) are what I'll look at next.

2022-10-27T01:42:47.820245 [INFO] - Listening on http://0.0.0.0:3333
2022-10-27T01:43:05.077116 [DEBUG] - prepare_and_call_pinniped_exchange took 17ms. Used cache?: false
2022-10-27T01:43:05.085273 [INFO] - GET https://kubernetes.default/api?timeout=32s 200 OK
2022-10-27T01:43:05.091663 [DEBUG] - prepare_and_call_pinniped_exchange took 5ms. Used cache?: true
2022-10-27T01:43:05.100437 [INFO] - GET https://kubernetes.default/apis?timeout=32s 200 OK
2022-10-27T01:43:05.106005 [DEBUG] - prepare_and_call_pinniped_exchange took 4ms. Used cache?: true
2022-10-27T01:43:05.209952 [DEBUG] - prepare_and_call_pinniped_exchange took 21ms. Used cache?: true
2022-10-27T01:43:05.299424 [DEBUG] - prepare_and_call_pinniped_exchange took 5ms. Used cache?: true
2022-10-27T01:43:05.311599 [DEBUG] - prepare_and_call_pinniped_exchange took 5ms. Used cache?: true
2022-10-27T01:43:05.493269 [DEBUG] - prepare_and_call_pinniped_exchange took 98ms. Used cache?: true
2022-10-27T01:43:05.593683 [DEBUG] - prepare_and_call_pinniped_exchange took 4ms. Used cache?: true
2022-10-27T01:43:05.604348 [DEBUG] - prepare_and_call_pinniped_exchange took 4ms. Used cache?: true
2022-10-27T01:43:05.697828 [DEBUG] - prepare_and_call_pinniped_exchange took 87ms. Used cache?: true
2022-10-27T01:43:05.811590 [DEBUG] - prepare_and_call_pinniped_exchange took 20ms. Used cache?: true
2022-10-27T01:43:06.004358 [DEBUG] - prepare_and_call_pinniped_exchange took 94ms. Used cache?: true
2022-10-27T01:43:06.098603 [DEBUG] - prepare_and_call_pinniped_exchange took 5ms. Used cache?: true
2022-10-27T01:43:06.108756 [DEBUG] - prepare_and_call_pinniped_exchange took 4ms. Used cache?: true

netlify · 2022-10-27T02:09:44Z

✅ Deploy Preview for kubeapps-dev ready!

Name	Link
🔨 Latest commit	`c848f59`
🔍 Latest deploy log	https://app.netlify.com/sites/kubeapps-dev/deploys/63775589d3e4d50009462191
😎 Deploy Preview	https://deploy-preview-5576--kubeapps-dev.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

absoludity · 2022-10-27T02:15:09Z

cmd/pinniped-proxy/src/cache.rs

+/// Importantly, checking the cache does not require a write-lock
+/// (unlike the [`Cached` trait's `cache_get`](https://github.com/jaemk/cached/blob/f5911dc3fbc03e1db9f87192eb854fac2ee6ac98/src/lib.rs#L203))
+#[derive(Default)]
+struct LockableCache<K, V>(RwLock<HashMap<K, V>>);


This struct is a NewType pattern which is a zero-cost abstraction (ie. no penalty at run-time) that defines a new type as a thin wrapper - a 1-tuple of another type, so that we can add our caching functions (get, insert) to this type without needing an extra struct. So in this case, a LockableCache is really just a read-write lock wrapping a hash, but one which behaves like a cache.

absoludity · 2022-10-27T02:22:51Z

cmd/pinniped-proxy/src/main.rs

+                record.args()
+            )
+        })
+        .init();


Woops - I'd left the .init() off when I re-enabled logging in my last PR, but didn't check it (so nothing will be logged without this change).

antgamdia

Thanks for digging into the root cause; it's a pity we had to drop the cached crait and implement it on our own. I didn't expect a read operation to be blocking :S

absoludity · 2022-11-07T00:35:27Z

Thanks for digging into the root cause; it's a pity we had to drop the cached crait and implement it on our own. I didn't expect a read operation to be blocking :S

Yeah - either did I, though I did afterwards remember that the cached trait keeps statistics of misses etc.

Signed-off-by: Michael Nelson <minelson@vmware.com>

vmwclabot added the cla-not-required label Oct 27, 2022

absoludity commented Oct 27, 2022

View reviewed changes

absoludity force-pushed the new-pinniped-cache-2 branch from c2149e7 to 733673b Compare November 2, 2022 19:42

antgamdia approved these changes Nov 4, 2022

View reviewed changes

absoludity force-pushed the new-pinniped-cache-2 branch from 733673b to beec648 Compare November 17, 2022 04:43

absoludity added 2 commits November 18, 2022 20:49

Switch cache to custom without write-lock for reads.

da7e1eb

Signed-off-by: Michael Nelson <minelson@vmware.com>

Update cargo.lock after dep removal.

c848f59

Signed-off-by: Michael Nelson <minelson@vmware.com>

absoludity force-pushed the new-pinniped-cache-2 branch from beec648 to c848f59 Compare November 18, 2022 09:51

absoludity merged commit a9d3601 into main Nov 18, 2022

absoludity deleted the new-pinniped-cache-2 branch November 18, 2022 10:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch cache to custom without write-lock for reads. #5576

Switch cache to custom without write-lock for reads. #5576

absoludity commented Oct 27, 2022

netlify bot commented Oct 27, 2022 •

edited

Loading

absoludity Oct 27, 2022

absoludity Oct 27, 2022

antgamdia left a comment

absoludity commented Nov 7, 2022

Switch cache to custom without write-lock for reads. #5576

Switch cache to custom without write-lock for reads. #5576

Conversation

absoludity commented Oct 27, 2022

Description of the change

Benefits

Possible drawbacks

Applicable issues

Additional information

netlify bot commented Oct 27, 2022 • edited Loading

✅ Deploy Preview for kubeapps-dev ready!

absoludity Oct 27, 2022

Choose a reason for hiding this comment

absoludity Oct 27, 2022

Choose a reason for hiding this comment

antgamdia left a comment

Choose a reason for hiding this comment

absoludity commented Nov 7, 2022

netlify bot commented Oct 27, 2022 •

edited

Loading