Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: Clarification around real world risks and use cases of VM module #40718

Closed
1 task
ghost opened this issue Nov 4, 2021 · 39 comments
Closed
1 task

docs: Clarification around real world risks and use cases of VM module #40718

ghost opened this issue Nov 4, 2021 · 39 comments
Labels
question Issues that look for answers. vm Issues and PRs related to the vm subsystem.

Comments

@ghost
Copy link

ghost commented Nov 4, 2021

📗 API Reference Docs Problem

  • Version: n/a

  • Platform: n/a

  • Subsystem: VM

Affected URL(s):

Description

Apologies if this isn't the appropriate place for this; I wasn't sure if it fell under help versus docs versus somewhere else.

The official docs for the VM module state the following in the first line of it's description.

The vm module is not a security mechanism. Do not use it to run untrusted code.

The last line of the Example: Running an HTTP server within a VM section says:

This may introduce risks when untrusted code is executed, e.g. altering objects in the context in unwanted ways.

Now, I think it's fairly safe to say it's quite clear that blindly running arbitrary code through the vm module (or anything else) is most definitely not secure or safe. Although the second quote is a bit less definitive and things like the contextCodeGeneration option in runInNewContext are almost a bit of a teaser that might lead people to believe in some cases it is okay, but ultimately I digress :)

This issue is two-fold:

First, I'm curious if it's possible to have additional context (no pun intended) about when the vm module is useful in practice to the average developer. I think it's safe to say a lot of people misunderstand it, and in some cases that can have very bad security consequences.

Second, I'm curious if it's possible to elaborate on situations where using the vm module in restricted/well-defined situations it could be safe to run code under certain conditions. The following is a contrived example and one could argue it's not even an example of untrusted code, but I'm hoping it'll at least illustrate the idea.

Say for example you take an expression and parse it into an AST and validate that it only includes certain nodes. For sake of argument, let's say you only accept identifiers, numeric literals, and operators like <, >, ==, and &&. Now take the following example:

const vm = require('vm');

const code = 'apple == 5 && blueberry > 10';
const ctx = Object.create(null);

ctx.apple = 55;
ctx.blueberry = 33;

const myBool = vm.runInNewContext(code, ctx, { contextCodeGeneration: { strings: false } });

Now, you could certainly argue if you're parsing a string into an AST, validating the nodes, etc. that you aren't really running untrusted code at that point, but bear with me. Is this even a valid use case for the module? Is this even safe assuming you can validate the AST and the context you're passing in? Does this make you want to hit your head against your desk? :P

Ultimately, there are obviously other ways to do this, you're already parsing it out, you could just make your own logic to piece it back together without ever running the actual code itself, but this seems like a simple way to actually achieve evaluation of simple JS expressions, among other potential use cases, etc.

I think the vm module is one of the most misunderstood modules out there, because a lot of developers I've talked with either completely misunderstand what it does, how you'd use it, or why you'd even use it, along with the potential for horrible security consequences when used inappropriately. It'd be awesome if the docs could include some more information on the practical side of things and where the module really shines in real world usage. Telling people when not to use something is certainly value added, but also illustrating real world use cases for when you would want to use it can be equally valuable.


  • I would like to work on this issue and
    submit a pull request.
@ghost ghost added the doc Issues and PRs related to the documentations. label Nov 4, 2021
@ghost ghost changed the title Clarification around real world risks and use cases of VM module docs: Clarification around real world risks and use cases of VM module Nov 4, 2021
@Mesteery Mesteery added question Issues that look for answers. vm Issues and PRs related to the vm subsystem. and removed doc Issues and PRs related to the documentations. labels Nov 4, 2021
@devsnek
Copy link
Member

devsnek commented Nov 4, 2021

i think the primary use case of vm is tooling. projects like jest, for example, make extensive use of it. however as soon as you start getting into running code that you haven't vetted, you must take 100% of the burden of validation onto yourself. the amount of footguns in the vm module here is immense. For example the option you mentioned, contextCodeGeneration, does not in fact prevent all forms of eval, it just disables the forms of eval that are created by the new context. If you combine this option with a membrane (basically layers of proxies between the inner and outer context) and provide an override for import() and have a timeout and set the microtask queue option correctly and etc etc it does actually provide the ability to run untrusted code, but again that's on the person using it to do very carefully.

maybe something we could put in the documentation there is a link to https://github.com/laverdet/isolated-vm, which does actually provide a way to run untrusted code with the guarantee that it cannot escape.

@ghost
Copy link
Author

ghost commented Nov 4, 2021

I think you summed up the footgun aspect of it much better than I ever could :P The first thing the average person thinks when they see the vm module is that it's some type of sandbox for arbitrary code. Seeing as that's precisely what it's not (except for very explicitly and carefully crafted cases), the next question I generally hear is "well, then what's the point?".

I think adding a little context (again, no pun intended) even as simple as the bit about tooling or testing frameworks would be super useful to help people understand/digest what it could be used for. Linking to something like isolated-vm could be extremely worthwhile as well. Instead of just telling people not to do something, telling them, but wait, here's an actual safe way you can do <insert_random_usually_bad_idea_here> I think could go a long way.

I realize my initial post might be a little long-winded, but I think there's a solid opportunity here for clarifying while also pointing people in constructive directions. I'd be more than happy to help write something up, but at the same time, I'll admit I definitely wouldn't put myself in the boat that fully understands all the caveats here, so having someone that does being able to ELI5 to a broader audience is definitely needed.

@mcollina
Copy link
Member

Here is an odd idea: maybe we should embed isolated-vm (or a very similar approach) into Node.js itself.

cc @bmeck

@cristianstaicu
Copy link

If you combine this option with a membrane (basically layers of proxies between the inner and outer context) and provide an override for import() and have a timeout and set the microtask queue option correctly and etc etc it does actually provide the ability to run untrusted code, but again that's on the person using it to do very carefully.

As discussed in the Hackerone report #1398777, I disagree with this statement. I think building a secure sandbox on top of this module is close to impossible because of its unpredictable/inconsistent behavior. IMHO, the statement "Do not use it to run untrusted code" should be emphasized further "Do not use it to run untrusted code or for building systems that run untrusted code". It should also be colored in red and underlined if possible. 😁

However, I agree that including a pointer to something like isolated-vm or Secure EcmaScript, or even vendoring them in would be a good idea.

@mcollina
Copy link
Member

cc @laverdet

@mcollina mcollina added the tsc-agenda Issues and PRs to discuss during the meetings of the TSC. label Nov 25, 2021
@Trott
Copy link
Member

Trott commented Dec 2, 2021

@mcollina We weren't sure why this was on the TSC agenda. (Awareness? Decision? Something else?) We pushed it to next week's agenda, but if you want to leave a sentence or two explaining what the ask of the TSC is here, that would be great.

@mcollina
Copy link
Member

mcollina commented Dec 3, 2021

There are two fundamental questions:

  1. Do we want to mark the vm module with a "Do not use it to run untrusted code or for building systems that run untrusted code" statement? There are plenty of modules in the ecosystem that claims this.

  2. Should we work to embed isolated-vm? Is that the solution we want?

Note this ties with some of the discussion about primordials.

@laverdet
Copy link
Contributor

laverdet commented Dec 3, 2021

Hi I'm the isolated-vm guy. I personally don't think the node team should attempt to implement and maintain secure environments in core node. Additionally, I think blessing isolated-vm via a link in the core documentation could be irresponsible. isolated-vm is a great tool but still needs to be utilized by seasoned experts to avoid numerous foot guns. I'm happy to jump on the TSC call to provide more color, but I'm also getting the sense that there's not actually a strong push here to get this kind of thing embedded.

@ghost
Copy link
Author

ghost commented Dec 4, 2021

I'll preface this by saying I don't have a strong opinion on whether something is added to node core itself, or whether something like isolated-vm is vendored in, but that's largely orthogonal to the purpose of why I created this issue.

I don't believe anyone is saying there aren't footguns or bad outcomes to doing this incorrectly, but I want to call out that's already happening right now; we're already living in the worst case scenario. There's plenty of use cases where people are already using the vm module in horribly insecure ways, and doubling down on making the docs less useful by adding scarier warnings isn't constructive to me.

Adding a link to the docs isn't giving a blessing to blindly use isolated-vm or any other linked module; it's about trying to give people a constructive path forward where they might be less likely to shoot themselves in the foot. The use case to run untrusted code isn't new and it's not going away, and instead of treating it like Voldemort, I'd love if we embrace that head on and try to find better ways to educate people while enabling them to do things using best practices.

I also strongly believe "running untrusted code" is somewhat of a strawman, because most people actually want to run things in more constrained scenarios or with restrictions, and being able to clearly articulate that with different options, along with their risks would be extremely valuable.

Unpopular opinion, and nothing personal against the authors of a lot of these libraries (I think they're awesome), I absolutely hate the security warnings on a lot of modules, including things like isolated-vm. Not because they aren't valid, but because to most people they aren't actionable. A lot of people will either read them, feel helpless, and use it anyways, or they'll just ignore it or use an even less secure module that doesn't have a bunch of scary warnings on it. There's nothing wrong with calling out or acknowledging security considerations, but it'd be awesome if they were more accessible and actionable to your average person.

tl;dr - I strongly believe there's a large gap between blindly running untrusted code and a lot of the practical use cases people actually care about. I 100% agree we can't boil the ocean, but we can definitely boil a few cups of water or even a whole pot.

Appreciate the discussion nonetheless :)

@laverdet
Copy link
Contributor

laverdet commented Dec 4, 2021

Documenting constraints and invariants of an API is just part of the job and, in the case of isolated contexts, security is a huge part of those constraints. I can't speak for other modules but the "Security" section of isolated-vm's README, has lots of actionable items [check nodejs for v8 updates which sneak in via semver-minor updates, implement process isolation, enable untrusted code mitigations, stay up to date on local privilege escalation vulnerabilities in your kernel, research container escape attacks, be aware of advanced persistent threats]. Adding more information about security actually came at the insistence of @cristianstaicu who chimed in here earlier.

The internet is an exceptionally hostile sea and getting these things wrong ends entire companies and causes non-theoretical real-world human suffering. The more awareness that can be raised about these things the better.

@ghost
Copy link
Author

ghost commented Dec 4, 2021

I'll completely agree the security section for isolated-vm is better than most, hands down, I don't argue that, but I'm still firmly in the camp those types of sections do more harm than good (not because security sections are bad but because most implementations/executions of said security sections aren't very good). I'll take my own advice here and try to provide some concrete examples rather than speaking in generalities (this is going to be scoped to isolated-vm but would also apply to the built-in vm module or docs. I'll preface this by saying don't take these use cases or examples too literally, I'm just trying to illustrate there might be subsets of "running untrusted code" that are understood or scoped well enough that can generally be done safely using best practices.

Let's say someone wants to run untrusted code with no context. In other words, I have a string, I want to execute it in a fresh context with nothing passed into it at all and return a value. Is there a safe way to do this?

const isolate = new ivm.Isolate({ memoryLimit: 8 });
const context = await isolate.createContext();
const data = context.eval(code, { timeout: 1 })

If we ignore the possibility of trying to blow up a process's memory or running an infinite loop, is the above code "safe"? By safe, are there known vulnerabilities or exploits that can be used to escalate privileges, read env variables, etc.?

Let's take an even more strict use case. What if we use something like @babel/parser to parse a string as a valid expression and only allow string and numeric literals (again, ignore the actual utility of doing this specifically)? Is there a safe way to do this?

// parse code using babel
// check the node type of each node
// if node type is not a string or numeric literal, throw an error, else continue

const isolate = new ivm.Isolate({ memoryLimit: 8 });
const context = await isolate.createContext();
const data = context.eval(code, { timeout: 1 })

The ultimate point I'm trying to make is there is a difference between an absolutist point of view on security and a pragmatic view on security. An absolutist point of view is that nothing is technically safe, there could be unknown zero-day exploits in V8 allowing remote code execution, and perhaps the nodejs.org homepage should simply have a giant red banner on it at all times stating to not use nodejs unless you have a deep understanding of security and everything involved, but I'm assuming most people would laugh me out of the room if I suggested that (rightfully so :P).

What I'm suggesting is that node take a more pragmatic approach and walk people through a handful of real-world use cases and risks. Start with using the vm module or isolated-vm or whatever with a simple string, passing in no data. Explain the risks, explain what expectation of safety you could expect. Then walk through a slightly more complex case, say passing in simple data like strings or numbers. Then another case building on it where you pass in an object, or run a function, etc.

I'll admit, when I first created this issue it was a bit opened ended, and it's probably gotten even broader since then, so I'll try to summarize my thoughts in a sentence: the current vm docs are useless, confusing, and contain a single contrived example that doesn't even scratch the surface as to why you'd use the vm module in the first place, and I'd love for that to change.

EDIT:

The internet is an exceptionally hostile sea and getting these things wrong ends entire companies and causes non-theoretical real-world human suffering. The more awareness that can be raised about these things the better.

I completely understand this view, but the question I'd challenge you with is if slapping security warnings or basically making people feel like unless they have 50 years of experience the answer is always "no you can't do that safely" is the most effective way to mitigate that risk? Or is it possible we still explain the very real risks of doing it wrong while still providing ways to practically apply said functionality using best practices in a safe way.

@cristianstaicu
Copy link

Since I am the guy arguing for big red security banners, I'll try to defend that viewpoint below. Overall, I like the examples @michael-melodyarc gave above, and a few months back, I would have agreed that they are a valuable addition to the current Node.js docs. But they would give a false sense of security to those reading them (especially the empty context example, as I will discuss below).

Like @devsnek above, I thought that we can build fairly secure sandboxes on top of the vm module, using membranes. I thought that most of the security issues we'll see are about exchanging references between the host code and the sandbox. That is actually false. A bit of context from my side: I am coordinating a research project that aims to test and harden JavaScript sandboxes, most of them built using the vm module. At first, the tool we built produced some interesting sandbox breakouts that we could understand, and thus, we could assist developers in fixing them. But then, we saw some very strange breakouts that we could not explain. After investigating further, it turned out that thevm module is leaking host references into the sandbox, in a very unpredictable way that cannot be intercepted by the membrane. In the PoCs we have, there is no explicit exchange of data/references between the host and the sandboxed code, i.e., the sandbox has an empty context. Thus, the membrane has nothing to intercept.

We are in the process of disclosing these patterns to the maintainers of the sandboxes, so unfortunately I cannot share them here, but if you have access to the Hackerone report I mentioned above, please take a look. The problem is that the maintainers do not have many options here: they can either abandon their fairly-popular packages or put pressure on you guys to fix the problem in the Node.js core, since their membrane-based approaches cannot catch these problematic references. My report was closed as "informative" after getting the following reply: "This looks like a vulnerability on those sandboxes and not the vm module itself as it is clearly marked as not being security sandbox. We clearly do not provide any security guarantee on the behavior of vm." While I understand this attitude (the bug I reported is really spooky, probably caused by the JIT, or some weird memory issue), I do not want to be in the shoes of those maintainers that now realize they are standing on quicksand.

I know that this is a bit vague without a clear code example, and I apologize for that. I hope my viewpoint is clear, though: the docs should reflect the reply I got on Hackerone. That is: do not build anything dealing with untrusted code using this module, as there probably are some quirky corner cases, which we are not willing to support you with. This will prevent situations like the ones we are in: millions of users are exposed (according to the download stats at least) without an easy way to deploy a fix. I know this will sound like academic blah-blah, but making security assumptions clear is crucial for smooth collaboration in open-source projects.

@ghost
Copy link
Author

ghost commented Dec 5, 2021

@cristianstaicu Appreciate the insight; while I'm not privy to your specific work (it definitely sounds interesting), I understand where you're coming from and what you're wanting to protect people against.

I want to clarify a few things on my part and wrap up with hopefully some concrete actions that node core could say yes or no to.

I'm not advocating we make the vm module secure; don't get me wrong, that'd be awesome if we could, but my main ask that started all this was just better docs / use cases around the module (i.e. what is the vm module actually good for). Although I probably sound like a broken record (sorry), I'm not against security warnings, more so one-sided security warnings (e.g. having security warnings along with examples of proper usage is great, having warnings with no practical examples is not so great). For an example, @devsnek mentioned jest makes extensive use of the vm module; that'd be a useful insight to have added to the docs; it's not something I had thought about previously as a use case but makes perfect sense in hindsight.

I know you can't get into the details, so perhaps isolated-vm is included in the "standing on quicksand" comment, but whether it's the vm module, isolated-vm, or some yet to be created package, it'd be great for node to still point out some more practical examples for people who are dabbling in that area.

Maybe the answer is you literally can't do anything securely no matter how simple, which is certainly not a fun answer, but if it's accurate, it is what it is. I'm hopeful we could provide some "cookbook" type docs that would maybe walk through some use cases better, and give examples, while still calling out risks, etc. in an easier to digest way. I'm not suggesting node core makes any absolute security guarantees about any of this, or expecting @laverdet to make any guarantees about blindly using isolated-vm either, but is there some middle ground between "this isn't secure with use cases left as an exercise to the reader" and a blanket blessing to blindly use something in bad ways?

Before I get to the list of concrete actions, I'll call out one assumption I'm making, which is there are at least some ways to do certain subsets of things securely, whether that's through vm, isolated-vm, or something else. If that's wrong and there isn't a way to guarantee the safety of something even as simple as executing 1 + 1, then some of what I'm going to suggest won't make any sense, because basically everything is broken by default at that point.

  1. Update the vm module docs to articulate better the actual use cases for when and why you'd want to use the module, bonus points for including how it's currently being used in the real world (not suggesting direct links to something like jest from the docs, but calling out things in more general cases like "testing frameworks or tooling" would work). Are there other real world use cases where using the vm module would shine?

  2. If there are better options like isolated-vm, it'd be really useful to at least point to more docs or information. I'm not asking node core to bless isolated-vm, but assuming it's base/core security assumptions are sound, it would be extremely helpful guidance even if it's just a pointer or link to the package.

  3. Are there some simple examples and use cases that could be given and walked through, calling out why certain subsets of use cases might be more secure or safer than others, while calling out what types of things will cause security issues. To be fair, I believe isolated-vm's docs do a decent job at starting this conversation, but then kind of just drops off without any practical usage or examples. I mean, it's maybe a bit contrived, but even just starting with executing 1 + 1 safely or some examples that only use primitives and binary or logical operators, etc.?

Without going down a rabbit hole, I'm simply curious if there is guidance (even if high-level) that could be given to help people at least start exploring this area or doing scoped down use cases of "executing untrusted code" even if it's not in the traditional sense of blindly running a blob but more so in the realm of "check if code is valid expression, parse into AST, validate node types, only use primitives, etc."

To end, I'm more than happy to spend some time writing up some sections for the docs if everyone is open to that, but I'll also acknowledge I don't fully understand this problem space, so it's a bit hard for me to write up what some of these sections might ultimately look like. Appreciate the in-depth responses.

@ghost
Copy link
Author

ghost commented Dec 5, 2021

@cristianstaicu - throwing out one last thing here. I'm genuinely curious what your initial reaction to something like this would be, is it (1) this is relative sane/safe and should work as expected, or (2) I can't believe there are people in the world who would even consider writing code like this :P

import ivm from 'isolated-vm';
import { parseExpression } from '@babel/parser';
import { traverseFast } from '@babel/types';

const code = '...';
const someJsonBlobThatIsParsed = JSON.parse('...');

const allowedNodeTypes = [
  'BinaryExpression',
  'Identifier',
  'LogicalExpression',
  'MemberExpression',
  'NumericLiteral',
  'StringLiteral',
];

traverseFast(parseExpression(code), (node) => {
  if (!allowedNodeTypes.includes(node.type)) {
    throw new Error('Node type not allowed: ' + node.type);
  }
});

const isolate = new ivm.Isolate({ memoryLimit: 8 });
const context = await isolate.createContext();

await context.evalClosure(
  'for (const [k, v] of Object.entries($0)) globalThis[k] = v',
  [someJsonBlobThatIsParsed],
  { arguments: { copy: true } },
);

const result = await context.eval(code, { copy: true, timeout: 1 });

The goal was to allow simple JavaScript expressions, like a.b > c && d == f or template strings / Array.reduce/map, etc. There are a handful of libraries that do bits and pieces, or templating languages galore, but we were looking into avoiding the whole "learn yet another syntax" when a simple JavaScript expression would suffice instead. This would be one of the reduced or subset of use cases I was alluding to earlier about if it's possible to do things like this safely without chaos ensuing.

@cristianstaicu
Copy link

First of all, isolated-vm uses a radically different way of isolating untrusted code, which is not affected by the findings I mentioned above. This is also the reason why I am advocating for a pointer to this package in the docs, or to this ECMAScript proposal. For the sake of discussion, let us assume we are talking about another sandbox that internally relies on the vm module.

The code you shared above looks ok-ish to me (maybe a yellow flag for this statement globalThis[k] = v that can rewrite important builtins of the globalThis, e.g., the prototype). You said that nobody writes code like this, but it turns out I published a similar proof of concept four years ago on npm, but for eval in that case. The problem is that nobody wants to maintain those allow lists and as soon as you encounter some code blocked by the restrictive policy, you will start adding AST types to your list, e.g., CallExpression. Considering the quirks of the language, you do not need many relaxations of the policy to end up with the entire JS semantics. Also, if you allow CallExpressions, how do you enforce the policy on dynamically loaded code, i.e., eval/Function? Our PoC breakout has 8 lines of code and it uses 5 additional AST types than the ones you listed in your example.

So, yes, I agree with you, there may be some safe use cases if you aggressively restrict the input, but IMHO most of the real-world use cases do not look like this. Thus, it is important to know what happens in case you messed up and your first line of defense, e.g., your membrane or static analysis, is not effective. Myself, I would either opt for a frozen runtime (à la SES) or for an isolation mechanism that uses the V8 Isolate (à la isolated-vm), which passed the test of time and is effectively deployed in billions of browsers.

Most importantly, you would want to make sure you are not in the case I described above, when the building blocks of your infrastructure (the vm module in this case) are just incompatible with your use case (arbitrary untrusted code) - and this is very hard to do in the npm world. Bottom line, I agree that the documentation should describe what the module is good at (insisting on the non-security use cases), but in addition, also what it is not good at, e.g. https://github.com/hacksparrow/safe-eval/issues.

@ghost
Copy link
Author

ghost commented Dec 5, 2021

Sorry, haha, I was being a bit tongue-in-cheek with my number 2 option :P I was more poking fun at myself since I wrote that snippet. I must have also gotten mixed up somewhere along the way as I thought you were initially against making any mention of isolated-vm in the docs, but I think that might have been laverdet's comment I was thinking of.

Anyways, wholeheartedly agree with the rest of your comment. Some of my suggestions are probably aimed at isolated-vm docs more than node's docs, but that depends on how in-depth node might go into the issue. Your comment is actually a great example of what I'd love to see in the docs though, basically thinking out loud and walking through an example.

So you want to execute some simple code that only allows some literals and a few operators? Okay, we can do that.
But what if you want to allow someone to call Array.map? Just allow CallExpression node types? OOPS, you just blew up half your previous assumptions.
What if you restrict the callee identifier to map? Blah blah blah....

Security is hard, and I think the current lack of examples (good uses and bad uses) makes it difficult for most to understand the why behind the recommendations. People generally aren't great thinking in abstract terms, so while saying vm module isn't safe (totally valid and correct) might stop some, having some examples that show precisely why it's such a terrible idea for security conscience use cases would stop a lot more.

Anyways, I'm rambling now, but I think there are a ton of learning opportunities in and around this problem space (it's a cool space), some which touch node core and some which don't, that could make a real difference to people looking at the docs and messing around with things.

@ghost
Copy link
Author

ghost commented Dec 5, 2021

@devsnek @mcollina - Piggy backing on:

Should we work to embed isolated-vm? Is that the solution we want?

Is there any context (no pun intended) for why the current vm module is implemented the way it is? Would there be any downsides to using an implementation closer to isolated-vm? As it stands right now, it seems like the vm module is forever destined to be this thing that will never quite work the way the average person expects, and is limited in actual utility.

Thought exercise, if jest swapped out vm for isolated-vm, would there be significant regressions? Are there performance limitations and fundamental incompatibilities? Curious if isolated-vm is closer to the ideal state regardless, or if there are reasons why you'd intentionally pick vm over something like isolated-vm.

@mcollina
Copy link
Member

mcollina commented Dec 6, 2021

I do not have the time to contribute much to this discussion minus the viewpoint of a security triager.

I think it would be awesome for the folks involved in this discussion to collaborate on some updated text for the documentation.

If you all would also like to work on some additional code which guarantee more safety it would be awesome too.

@bmeck
Copy link
Member

bmeck commented Dec 6, 2021

I think people asking for ways to make it more secure should attend the SES/Realms TC39 calls, because I completely agree with @michael-melodyarc that doing so is not feasible or practical in a general sense. Lots of time in those calls goes into things like how storing an Object in an internal slot (private field) can absolutely break out of a membrane. These kinds of nuances are not things I think could be taught by simple documentation nor would be backwards compatible with the API design of the vm module to prevent.

@mhdawson
Copy link
Member

mhdawson commented Dec 16, 2021

From the TSC meeting today. We did not have people with the required context but this is what we discussed on the two questions to the TSC.

  • Do we want to mark the vm module with a "Do not use it to run untrusted code or for building systems that run untrusted code" statement? There are plenty of modules in the ecosystem that claims this.
  • Should we work to embed isolated-vm? Is that the solution we want?
    • There is comment in issue from maintainer of isolate-vm that says we don’t want to do that

@cristianstaicu
Copy link

shadowrealm is more or less equivalent to VM

I agree, but the devil is in the details. Since the vm module is a wrapper around V8 Isolates, in theory it could provide similar guarantees as let's say isolated-vm, shadowrealm or any other similar proposal, but it does not. As I mentioned above, we found a clear bug in Node.js/vm module and I reported it on HackerOne because it breaks the assumption that "membranes on top of vm module are enough to isolate untrusted code" in a lot of sandboxes we analyzed. I got the reply: "This looks like a vulnerability on those sandboxes and not the vm module itself as it is clearly marked as not being security sandbox. We clearly do not provide any security guarantee on the behavior of vm." I understand that the Node.js team can decide what goes into their product and what not, but this should be made clear to the community, the sandbox creators, users, etc. Once again, this discussion is about willingness to support certain use cases over others.

@targos
Copy link
Member

targos commented Jan 29, 2022

Since the vm module is a wrapper around V8 Isolates

That's not true. The vm module is a wrapper around V8 Contexts. All contexts created with it are attached to the same isolate. That's why objects can be shared between contexts and the Node.js process and that's why it doesn't provide an isolated sandbox.

@cristianstaicu
Copy link

Sorry, my mistake there. What I wanted to say is that for base isolation, all these systems rely on tried and tested building blocks from the engine (Isolate, Context). So they should provide similar guarantees as those building blocks, or? My understanding is that inside the browser, Contexts are also used to isolate mutually untrusted components, i.e., iframes, scripts of Chrome extensions, etc.. So why couldn't they be used for the same purpose in Node.js? Am I missing something? Sure, once you allow shared references between contexts a lot of subtle problems may appear, but the bug we found was in a "fresh" new context where you do not expect many such subtleties.

@ghost
Copy link
Author

ghost commented Jan 29, 2022

At the risk of coming across hyperbolic, can the docs be updated to include something like the following (ignore the exact wording, just iterating intent)?

The vm module is not a security mechanism. Do not use it to run untrusted code. If a library that relies on the vm module makes any security guarantees, they are null and void. It is not possible to build a secure sandbox or run untrusted code using the vm module in any capacity, full stop.

At this point, there are two main discussions here:

  1. What should the docs be updated to say?
  2. This could possibly be its own separate issue, but why is the vm module the way it is?

The reason I keep getting hung up on 2 is that the vm module seems to be full of holes by design while serving no constructive purpose. If something like isolated-vm can do everything the vm module can do, and then some, while also being an order of magnitude more secure by default, would node want to eventually move to simply use a model similar to isolated-vm as the default, rather than an insecure by default and impossible to actually secure current state vm module?

@cristianstaicu
Copy link

While I tend to say yes to your first proposal/question, I am not sure that that is the responsible thing to do at this point. There is a certain sandbox with five million weekly downloads, built on top of vm. Isn't it irresponsible to tell all those people that they are on their own? For 2., my bet is that the answer is performance: spawning one Isolate for each Jest test sounds like an overkill. But I guess someone from the Node.js core team should comment on this.

@ghost
Copy link
Author

ghost commented Jan 29, 2022

Isn't it irresponsible to tell all those people that they are on their own?

But they already are on their own, they just don't know it. I totally agree with you in terms of not wanting to leave people to fend for themselves, but it seems like we're already living in the worst case scenario: things aren't secure/safe AND no one wants to say it out loud (minus a one sentence disclaimer in the docs). To me, the most responsible thing to do is make it blatantly obvious what they're doing isn't safe (and even better if can give clear examples / information as to why).

@laverdet
Copy link
Contributor

laverdet commented Jan 30, 2022

This documentation is a message... and part of a system of messages... pay attention to it!

Writing this documentation was important to us. We considered ourselves to be a powerful
culture.

This module is not a thing of honor... no highly esteemed deed is commemorated here... nothing
valued is here.

What is here was dangerous and repulsive to us. This documentation is a warning about danger.

The danger is in a particular module... it increases towards a function... the center of danger
is here... of a particular definition and declaration, and in this module.

The danger is still present, in your time, as it was in ours.

The danger is to the running process, and it can yield complete control.

The form of the danger is an insecure sandbox.

The danger is unleashed only if you use this module to run untrusted software. This module is
best shunned and left unimported.

@mcollina
Copy link
Member

remove the tsc-agend as there is a PR open.

Trott added a commit to Trott/io.js that referenced this issue Feb 10, 2022
nodejs-github-bot pushed a commit that referenced this issue Feb 12, 2022
Refs: #40718

PR-URL: #41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
@Mesteery
Copy link
Contributor

Is this issue resolved?

@Trott
Copy link
Member

Trott commented Feb 19, 2022

Is this issue resolved?

I think so, but if anyone has more concrete things that need to be done, leave a comment!

@Trott Trott closed this as completed Feb 19, 2022
bengl pushed a commit to bengl/node that referenced this issue Feb 21, 2022
Refs: nodejs#40718

PR-URL: nodejs#41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
bengl pushed a commit to bengl/node that referenced this issue Feb 21, 2022
Refs: nodejs#40718

PR-URL: nodejs#41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
bengl pushed a commit that referenced this issue Feb 22, 2022
Refs: #40718

PR-URL: #41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
danielleadams pushed a commit to danielleadams/node that referenced this issue Apr 21, 2022
Refs: nodejs#40718

PR-URL: nodejs#41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
danielleadams pushed a commit that referenced this issue Apr 24, 2022
Refs: #40718

PR-URL: #41916
Reviewed-By: Gus Caplan <me@gus.host>
Reviewed-By: Tobias Nießen <tniessen@tnie.de>
Reviewed-By: Beth Griggs <bgriggs@redhat.com>
Reviewed-By: Mohammed Keyvanzadeh <mohammadkeyvanzade94@gmail.com>
Reviewed-By: Antoine du Hamel <duhamelantoine1995@gmail.com>
Reviewed-By: Luigi Pinca <luigipinca@gmail.com>
Reviewed-By: Mestery <mestery@protonmail.com>
Reviewed-By: Derek Lewis <DerekNonGeneric@inf.is>
Reviewed-By: James M Snell <jasnell@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Issues that look for answers. vm Issues and PRs related to the vm subsystem.
Projects
None yet
Development

No branches or pull requests