module: validate that strip-types does not insert any code #54141

ChALkeR · 2024-07-31T11:47:31Z

We can do this while the mode is strip-types only

The wasm binary is not expected to insert any other changes in the sourcecode except for the allowed symbols / patterns
We can use this to increase trust in the code that wasm binary outputs

It is still possible to do unexpected things under this limitation, but it makes anything funny much harder for a supply chain attack

Now does this if internal swc lib is e.g. switched to a codegen mode unexpectedly:

Existing tests should pass, this is just a safeguard
Should also add some multi-byte stripping tests

nodejs-github-bot · 2024-07-31T11:47:36Z

Review requested:

@nodejs/loaders

marco-ippolito · 2024-07-31T12:24:45Z

I'm ok with 346469c, n2 can we do that in amaro with a flag?

mcollina

I think this is the wrong place to solve this.
This adds non-null overhead for loading every single file to defend against code that we control. Let's fix the reproducible build problem.

I'm ok if we run this check in CI either in the form of a test or an optional check.

ChALkeR · 2024-07-31T12:46:54Z

This adds non-null overhead for loading every single file

@mcollina not "every single file", but every local typescript file under --experimental-strip-types flag which is processed by wasm, where perf issues were not being addressed yet and are on the roadmap

This does not affect existing setups

Running this in CI only won't give any sufficient gurantees

to defend against code that we control.

We don't

No one here reviewed those 500 MiBs of deps in /amaro repo

No one even noticed those are not being in fact used from that repo but are coming from elsewhere

aduh95

LGTM, regardless of reproducable builds it seems useful to have some guarentee that V8 will indeed execute users' code and nothing else. It only slow down files with types to strip, not all files IIUC.

nit: Any chance we can avoid the Buffer -> string -> Buffer conversion in e.g.

node/lib/internal/modules/esm/translators.js

Line 312 in 35f92d9

const code = tsParse(stringify(source));

Maybe we could supply another parameter that contains the original source in addition to the stringified version.

ronag

I don't understand what this solves? We don't trust the strip code and another layer during runtime to make sure it does? Shouldn't this just be a test that runs in CI prior to release? Can download x number of repositories run the type stripping and test that it does what it should.

ChALkeR · 2024-07-31T13:03:35Z

@ronag a supply chain attack like the one on ESLint can hide itself and activate itself under specific conditions

Testing on CI does not help validate blackboxes against intentional attacks (only against unintentional mistakes)

Sandboxing + validating in runtime does

lib/internal/modules/helpers.js

Co-authored-by: Antoine du Hamel <duhamelantoine1995@gmail.com>

joyeecheung · 2024-07-31T13:09:42Z

Instead of doing checks at runtime, maybe we can just have a flag that spits out the type-stripped code to a directory (or just to stdout and let users redirect as they like?) That might also be useful for those who try to debug the type stripping, in case there are any bugs.

ChALkeR · 2024-07-31T13:16:48Z

As for the perf concern: this adds 3% overhead to pure transformSync (not counting file load, execution, etc -- even less with those counted)

I.e. this check is ~33x faster than transformSync even without the buffer optimization mentioned above

(and affects only files transformed from typescript under --experimental-strip-types flag by wasm)

ChALkeR · 2024-07-31T13:19:51Z

@joyeecheung that seems much more complex (user options/format/output decisions etc) and doesn't solve the problem of a potentally carefully hidden exploit in the supply chain that activates only after specific conditions

We can't expect users to check all code generated at runtime

Especially given that it's immediately executed either way

ronag · 2024-07-31T13:36:12Z

We also have to consider that this might make future progress and iteration on this feature more difficult (possibly turning away contributors). For example moving beyond just type stripping.

ronag · 2024-07-31T13:37:19Z

How does bun and deno solve this?

ChALkeR · 2024-07-31T13:37:38Z

@ronag landing this would fix the immediate concern, it doesn't mean that it can't be removed once we figure out {build chain / what gets into the bundle / how do we review it} process

Which should be done by the time something except typestripping is added

aduh95 · 2024-07-31T13:46:29Z

lib/internal/modules/helpers.js

+  const sourceBuf = BufferFrom(source);
+  const codeBuf = BufferFrom(code);
+  if (sourceBuf.length !== codeBuf.length) { return false; }


btw, have you considered doing a comparison of the JS strings? It seems to me it would be much simpler (and maybe more performant) than comparing buffers, especially given the parameters are already strings. wdyt?

for (let i = 0; i < source.length; i++) { const char = StringPrototypeCodePointAt(code, i); if (char !== 0x3b && char !== 0x20 && char !== StringPrototypeCodePointAt(source, i)) { return false; } } return true;

@aduh95 will recheck!

I didn't use that because the original impl which this rechecks is byte-level based
But it can be done, yes

and maybe more performant

perf results are inconclusive on this even after initial optimization of this for fast paths -- this is faster for multi-byte but slower for regular ascii sources

will try another version shortly

For context, the code I used was:

function doesTSStripTypesResultMatchSource2(code, source) { if (code.length !== source.length) { return false; } for (let i = 0; i < code.length; i++) { const a = code.codePointAt(i) if (a === source.codePointAt(i)) continue if (a !== 0x20 && a !== 0x3b && a !== 0xa0 && a !== 0x2002 && a !== 0xfeff) return false } return true }

The set of chars is different

This also misses a check for now that 0xfeff can come only after a space

I think we can land Buffer-based impl and optimize it later if needed (hopefully not)

mcollina

lgtm

I think we can remove this at a later step.

codecov · 2024-07-31T14:26:58Z

Codecov Report

Attention: Patch coverage is 44.00000% with 28 lines in your changes missing coverage. Please review.

Project coverage is 87.06%. Comparing base (b4fd1fd) to head (1e8101d).
Report is 4 commits behind head on main.

Files	Patch %	Lines
lib/internal/modules/helpers.js	44.00%	28 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main   #54141   +/-   ##
=======================================
  Coverage   87.06%   87.06%           
=======================================
  Files         643      643           
  Lines      181576   181625   +49     
  Branches    34894    34895    +1     
=======================================
+ Hits       158088   158140   +52     
- Misses      16759    16777   +18     
+ Partials     6729     6708   -21

Files	Coverage Δ
lib/internal/modules/helpers.js	`90.95% <44.00%> (-6.76%)`	⬇️

... and 33 files with indirect coverage changes

marco-ippolito

As discussed in the tsc meeting, I believe this check belongs inside amaro behind a configuration flag that should be enabled by node

nodejs-github-bot · 2024-07-31T14:36:33Z

CI: https://ci.nodejs.org/job/node-test-pull-request/60771/

ChALkeR · 2024-07-31T14:44:08Z

Yes, that seems to solve the concerns as well!
Will file a PR shortly today

ChALkeR · 2024-07-31T15:57:40Z

One thing that is degraded from moving into amaro directly is that it' now not possible to use safe methods that were cached prior to user code being loaded

Which means that there will be an easy way for the users to circumvent this check when it's moved into amaro itself

... which seems to be an acceptable risk / out of scope

RafaelGSS · 2024-07-31T18:02:21Z

Which means that there will be an easy way for the users to circumvent this check when it's moved into amaro itself
... which seems to be an acceptable risk / out of scope

Possibly, yes. But, according to our threat model, this is ok.

ChALkeR · 2024-07-31T18:18:21Z

Yeah, just mentioning for visibility/transparency
Moving into the lib is still ok

nodejs-github-bot added the needs-ci PRs that need a full CI run. label Jul 31, 2024

ChALkeR force-pushed the chalker/swc-safeguard branch 5 times, most recently from 31545ec to 46fd9dd Compare July 31, 2024 11:58

module: set swc transform mode explicitly to 'strip-only'

346469c

ChALkeR force-pushed the chalker/swc-safeguard branch from 46fd9dd to 08ecab7 Compare July 31, 2024 12:09

ChALkeR marked this pull request as ready for review July 31, 2024 12:15

ChALkeR force-pushed the chalker/swc-safeguard branch 2 times, most recently from 8e3566e to 4ab7ef5 Compare July 31, 2024 12:22

marco-ippolito added the strip-types Issues or PRs related to strip-types support label Jul 31, 2024

ChALkeR force-pushed the chalker/swc-safeguard branch 3 times, most recently from 294585f to 239e6bb Compare July 31, 2024 12:30

mcollina requested changes Jul 31, 2024

View reviewed changes

ChALkeR force-pushed the chalker/swc-safeguard branch from 239e6bb to fc9fa66 Compare July 31, 2024 12:42

module: validate that strip-types does not insert any code

ae27328

ChALkeR force-pushed the chalker/swc-safeguard branch from fc9fa66 to ae27328 Compare July 31, 2024 12:43

aduh95 approved these changes Jul 31, 2024

View reviewed changes

ronag reviewed Jul 31, 2024

View reviewed changes

aduh95 reviewed Jul 31, 2024

View reviewed changes

lib/internal/modules/helpers.js Outdated Show resolved Hide resolved

Update lib/internal/modules/helpers.js

1e8101d

Co-authored-by: Antoine du Hamel <duhamelantoine1995@gmail.com>

ChALkeR mentioned this pull request Jul 31, 2024

safeguarding code changes done by wasm binary nodejs/amaro#17

Open

aduh95 reviewed Jul 31, 2024

View reviewed changes

mcollina approved these changes Jul 31, 2024

View reviewed changes

mcollina added the request-ci Add this label to start a Jenkins CI on a PR. label Jul 31, 2024

marco-ippolito requested changes Jul 31, 2024

View reviewed changes

github-actions bot removed the request-ci Add this label to start a Jenkins CI on a PR. label Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

module: validate that strip-types does not insert any code #54141

module: validate that strip-types does not insert any code #54141

ChALkeR commented Jul 31, 2024 •

edited

Loading

nodejs-github-bot commented Jul 31, 2024

marco-ippolito commented Jul 31, 2024

mcollina left a comment

ChALkeR commented Jul 31, 2024 •

edited

Loading

aduh95 left a comment

ronag left a comment •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

joyeecheung commented Jul 31, 2024 •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

ChALkeR commented Jul 31, 2024

ronag commented Jul 31, 2024

ronag commented Jul 31, 2024

ChALkeR commented Jul 31, 2024 •

edited

Loading

aduh95 Jul 31, 2024 •

edited

Loading

ChALkeR Jul 31, 2024

ChALkeR Jul 31, 2024

ChALkeR Jul 31, 2024 •

edited

Loading

mcollina left a comment

codecov bot commented Jul 31, 2024 •

edited

Loading

marco-ippolito left a comment •

edited

Loading

nodejs-github-bot commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

RafaelGSS commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

module: validate that strip-types does not insert any code #54141

Are you sure you want to change the base?

module: validate that strip-types does not insert any code #54141

Conversation

ChALkeR commented Jul 31, 2024 • edited Loading

nodejs-github-bot commented Jul 31, 2024

marco-ippolito commented Jul 31, 2024

mcollina left a comment

Choose a reason for hiding this comment

ChALkeR commented Jul 31, 2024 • edited Loading

aduh95 left a comment

Choose a reason for hiding this comment

ronag left a comment • edited Loading

Choose a reason for hiding this comment

ChALkeR commented Jul 31, 2024 • edited Loading

joyeecheung commented Jul 31, 2024 • edited Loading

ChALkeR commented Jul 31, 2024 • edited Loading

ChALkeR commented Jul 31, 2024

ronag commented Jul 31, 2024

ronag commented Jul 31, 2024

ChALkeR commented Jul 31, 2024 • edited Loading

aduh95 Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

ChALkeR Jul 31, 2024

Choose a reason for hiding this comment

ChALkeR Jul 31, 2024

Choose a reason for hiding this comment

ChALkeR Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

mcollina left a comment

Choose a reason for hiding this comment

codecov bot commented Jul 31, 2024 • edited Loading

Codecov Report

marco-ippolito left a comment • edited Loading

Choose a reason for hiding this comment

nodejs-github-bot commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

RafaelGSS commented Jul 31, 2024

ChALkeR commented Jul 31, 2024

ChALkeR commented Jul 31, 2024 •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

ronag left a comment •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

joyeecheung commented Jul 31, 2024 •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

ChALkeR commented Jul 31, 2024 •

edited

Loading

aduh95 Jul 31, 2024 •

edited

Loading

ChALkeR Jul 31, 2024 •

edited

Loading

codecov bot commented Jul 31, 2024 •

edited

Loading

marco-ippolito left a comment •

edited

Loading