Introduce TRAP caching #1172

edoardopirovano · 2022-08-05T15:32:23Z

This PR introduces everything that needs to happen on the Action's side for TRAP caching. In particular:

We run codeql resolve languages --format=betterjson to see the extractor options.
If we see an extractor that supports TRAP caching then we:
- If we're running on the default branch: Start from a fresh cache, tell the extractor to write up to 1GB to the cache (tbc: is this a sensible amount for size of the Actions cache and the disk space of the default runners?). Upload it near the end of the run (after we've sent results to Code Scanning and uploaded the DB to MRVA, since both of those are higher priority). (sample run with some now-removed debugging output)
- If we're running on a PR branch: Download the cache for the base commit if we have one, or any other recent commit failing that at the init step. Then we use that cache in read-only mode during extraction. (sample run with some now-removed debugging output)

All of this is gated behind a feature flag, although the trap-caching parameter to the init step can be used to force this to be on or off irrespective of the feature flag. I expect a few users will want to disable this if they're already using their Actions cache for something else so we're not competing for space.

We should document this new feature, and the option to disable it, before we turn this on for any external users. But for now we're only going to turn on the feature flag for some internal repositories, so I think it is fine to not have it in the public facing documentation just yet.

Before we enabled this on more than one or two internal repos (which we can monitor by hand), I would also like to implement some telemetry so we can have some visibility over whether this is causing any issues. Again, I propose leaving this for a later PR.

Merge / deployment checklist

Confirm this change is backwards compatible with existing workflows.
Confirm the readme has been updated if necessary.
Confirm the changelog has been updated if necessary.

src/trap-caching.ts

henrymercer

Tell the extractor to write up to 1GB to the cache (tbc: is this a sensible amount for size of the Actions cache and the disk space of the default runners?).

1 GB per language sounds reasonable to me as a default. I can think of cases where we might want to use more or less space however — perhaps we should make the TRAP cache size configurable? Question: Should the total size or the size per language be configurable?

We should document this new feature, and the option to disable it, before we turn this on for any external users. But for now we're only going to turn on the feature flag for some internal repositories, so I think it is fine to not have it in the public facing documentation just yet.

Sounds reasonable.

Before we enabled this on more than one or two internal repos (which we can monitor by hand), I would also like to implement some telemetry so we can have some visibility over whether this is causing any issues. Again, I propose leaving this for a later PR.

Also sounds reasonable 👍

src/config-utils.ts

src/config-utils.test.ts

src/config-utils.ts

src/init-action.ts

src/trap-caching.test.ts

src/trap-caching.ts

edoardopirovano · 2022-08-09T17:54:58Z

perhaps we should make the TRAP cache size configurable? Question: Should the total size or the size per language be configurable?

Indeed, we may eventually want configuration options to fine-tune this. I don't think adding this should block getting this prototype out of the door, though. In the telemetry I will be adding, I will include a field that records the size of the cache so we can use that to get a feel for what the default should be and how configurable we need this to be before we properly declare this GA.

asottile-sentry · 2023-10-17T16:28:17Z

are there docs explaining what this is or why one would want to turn it on or off? I didn't see it mentioned in the docs linked from the README and this is chewing up our entire GHA cache since it seems to put many MB per commit

adityasharad · 2023-10-17T16:57:24Z

@asottile-sentry I'm sorry to hear this feature has caused trouble for you. The changelog note here has some explanation of this feature and how to turn it off. The intended purpose is to speed up the "extraction" phase of CodeQL analysis (which processes your source code into a local database format so that it can be analysed), however this is completely optional and you are safe to turn it off in the workflow since it's using up your Actions cache quota. We have only rolled this out in a limited fashion for one language at the moment, so it is not in the public docs beyond the changelog.

To help us understand and investigate the problem better, would you able to point us to your repo, and let us know if you're using the Actions cache for any other purposes?

asottile-sentry · 2023-10-17T17:32:55Z

https://github.com/getsentry/sentry/actions/caches -- yes we use the cache for other things (I suspect this to be the common case -- so perhaps enabling this by default in codeql is the wrong default?)

sayhiben · 2023-12-12T20:47:58Z

In case the additional context helps, trap caching is also causing my organization's monorepo to exhaust our cache very rapidly. I had to turn off the feature, even though I want to use it. We use the GHA cache for other purposes as well and can't afford to let this churn our cache. I've opened another issue to hopefully move the cache storage location: #2030

edoardopirovano requested a review from a team as a code owner August 5, 2022 15:32

github-advanced-security bot found potential problems Aug 5, 2022

View reviewed changes

src/trap-caching.ts Fixed Show resolved Hide resolved

edoardopirovano mentioned this pull request Aug 5, 2022

JS: Change how TRAP cache is configured github/codeql#9949

Merged

Introduce TRAP caching

8f867dc

edoardopirovano force-pushed the edoardo/trap-caching branch from df5d008 to 8f867dc Compare August 5, 2022 16:48

github-advanced-security bot found potential problems Aug 5, 2022

View reviewed changes

src/trap-caching.ts Fixed Show resolved Hide resolved

henrymercer reviewed Aug 9, 2022

View reviewed changes

Address review comments from @henrymercer

6df9361

github-advanced-security bot found potential problems Aug 9, 2022

View reviewed changes

src/trap-caching.ts Show resolved Hide resolved

henrymercer approved these changes Aug 9, 2022

View reviewed changes

edoardopirovano merged commit 07720c7 into main Aug 9, 2022

edoardopirovano deleted the edoardo/trap-caching branch August 9, 2022 18:18

This was referenced Aug 17, 2022

Merge main into releases/v2 #1189

Closed

Merge main into releases/v2 #1192

Merged

Merge releases/v2 into releases/v1 #1195

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce TRAP caching #1172

Introduce TRAP caching #1172

edoardopirovano commented Aug 5, 2022 •

edited

Loading

henrymercer left a comment

edoardopirovano commented Aug 9, 2022

asottile-sentry commented Oct 17, 2023

adityasharad commented Oct 17, 2023

asottile-sentry commented Oct 17, 2023

sayhiben commented Dec 12, 2023

Introduce TRAP caching #1172

Introduce TRAP caching #1172

Conversation

edoardopirovano commented Aug 5, 2022 • edited Loading

Merge / deployment checklist

henrymercer left a comment

Choose a reason for hiding this comment

edoardopirovano commented Aug 9, 2022

asottile-sentry commented Oct 17, 2023

adityasharad commented Oct 17, 2023

asottile-sentry commented Oct 17, 2023

sayhiben commented Dec 12, 2023

edoardopirovano commented Aug 5, 2022 •

edited

Loading