Use seccomp instead of setsid() to workaround CVE-2017-5226 #150

alexlarsson · 2017-01-16T09:51:07Z

The setsid() workaround of
#143 is problematic,
because it e.g. breaks shell job control for bubblewrap instances.
So, instead we use a seccomp approach based on:
util-linux/util-linux@8e49250
However, since we don't want to pull in any more dependencies into
the setuid binary we pre-compile the seccomp code during the build.

If libseccomp is not available on your architecture, we still support
the old fix with --disable-seccomp-tty-fix.

This fixes #147

The setsid() workaround of containers#143 is problematic, because it e.g. breaks shell job control for bubblewrap instances. So, instead we use a seccomp approach based on: util-linux/util-linux@8e49250 However, since we don't want to pull in any more dependencies into the setuid binary we pre-compile the seccomp code during the build. If libseccomp is not available on your architecture, we still support the old fix with --disable-seccomp-tty-fix. This fixes containers#147

alexlarsson · 2017-01-16T09:52:30Z

@smcv I tested this with your test.c from #142 and it seems to work.

cgwalters · 2017-01-16T12:58:18Z

Hmm. But this still leaves bwrap users on other arches with that behavior, which we'd still have to then document how to work around in examples, etc.

Note that util-linux reverted their seccomp change; the commit message doesn't document extensively why though.

cgwalters · 2017-01-16T13:00:48Z

I'm not opposed to this though if you think it's worth it (which I guess you do having written the patch 😄 )

alexlarsson · 2017-01-16T13:10:18Z

Its definately worth it, I've updated flatpak to the bubblewrap 0.1.6 in master, and it is completely useless on the command line. Whatever you do you end up with multiple processes reading from the terminal and you have to start a new terminal to find which one to kill. Its a complete show-stopper imho,

alexlarsson · 2017-01-16T13:11:55Z

Are there any arches that don't have seccomp yet though? If so its probably just a matter of time before they get it.

Also, I don't see how you can work around any issues with this...

cgwalters · 2017-01-16T14:56:34Z

You can work around it by creating a new pty and having the shell use that as the controlling terminal. I don't have a handy one liner for this, but e.g. running tmux seems to work just fine.

alexlarsson · 2017-01-16T15:06:14Z

You can work around the sh: cannot set terminal process group (-1): Inappropriate ioctl for device" error, but you can't work around the fact that if you e.g. run bwrap --bind / / sh, then ctrl-C then you get two shell reading the input from the same terminal with intermingled output. Trying to use flatpak for development and debugging just one day with this makes it obvious that it won't work at all.

alexlarsson · 2017-01-16T15:08:04Z

For instance, you can't do flatpak run org.my.App and expect ctrl-C to kill it.

cgwalters · 2017-01-16T17:33:17Z

We could address the Ctrl-C UX by having a process outside of the container that acts as a lifecycle bind to init inside the container. Basically both watch a pipe, and if one exits, the other does.

alexlarsson · 2017-01-16T17:35:17Z

That's just a single example though. The general issue is that it doesn't behave like a UNIX command.

alexlarsson · 2017-01-16T17:36:50Z

I.E it doesn't get sigstop when it writer to the try when backgrounded, etc

cgwalters · 2017-01-16T17:37:52Z

configure.ac

+
+if test "x$enable_seccomp_tty_fix" = "xno"; then
+   AC_DEFINE([DISABLE_SECCOMP], [1],
+      [Define if using seccomp])


if not using

cgwalters · 2017-01-16T17:43:06Z

Makefile-bwrap.am

+generate_seccomp_LDADD = $(LIBSECCOMP_LIBS) $(SELINUX_LIBS)
+
+seccomp-filter.h: generate-seccomp$(EXEEXT)
+	./generate-seccomp$(EXEEXT) > seccomp-filter.h


Does this work in a cross-compilation scenario? Offhand it seems we'd generate rules for the wrong arch?

Well, it will fail to cross-compile because the built generate-seccomp will fail to run.

cgwalters · 2017-01-16T17:44:34Z

What UNIX/commandline issues wouldn't be addressed by having an "outer init"? (I realized there's no need for a pipe, the "outer init" could just have the "container init" as a child process, and the "outer init" watches it via SIGCHILD, and container init should use PR_SET_PDEATHSIG)

cgwalters · 2017-01-16T17:50:23Z

Eh, SIGSTOP. Yeah, true. We'd have to go to proxying read/write and signals across. I guess really we'd end up with a userspace pty emulator, just without TIOCSTI.

alexlarsson · 2017-01-16T17:50:58Z

We could have an outer init that proxies stdin/stdout/stderror and signals (SIGSTOP/SIGCONT, etc), but it would never quite get the right thing, because e.g. /dev/tty will not work properly inside it. It also seems very complex and easy to get wrong.

cgwalters · 2017-01-16T17:55:59Z

I think for /dev/tty we could argue that bwrap-using software like flatpak should implement a shell command which would inject code into the container to allocate a pty and run a shell with it.

cgwalters · 2017-01-16T17:58:02Z

The scope for "outer init" would be a lot smaller if we specified that we didn't support ttys (e.g. TIOCGWINSZ); basically if you couldn't run vi/emacs/tmux/etc inside.

alexlarsson · 2017-01-16T17:59:15Z

we still have to proxy things like canonical/raw mode, etc, no?

cgwalters · 2017-01-16T17:59:53Z

Although, "outer init + setsid()" still seems viable for the basic "ctrl-c'able" case. Not sure how many people would initially notice the SIGSTOP/backgroundable bit.

cgwalters · 2017-01-16T18:01:22Z

OK so...why explicitly generate the rules at build time? (And do you agree it breaks cross compilation?) Is there other precedent for this? systemd for example doesn't seem to do this.

alexlarsson · 2017-01-16T18:02:06Z

Because I don't want to load essentially a compiler into a setuid binary.

smcv · 2017-01-16T18:05:27Z

we could argue that bwrap-using software like flatpak should implement a shell command which would inject code into the container to allocate a pty and run a shell with it

Here is that code: xterm :-P

(or an xterm clone with lighter-weight dependencies)

cgwalters · 2017-01-16T18:16:18Z

It is a compiler of sorts, but it's operating on known, static trusted input. I'm not sure it's worth breaking cross builds for this, though I admit to not actively maintaining a cross-built system right now myself.

alexlarsson · 2017-01-16T18:36:03Z

Last fixup drops the generate at build-time

cgwalters · 2017-01-16T19:12:29Z

One other concern that pops to mind - we're still vulnerable without the ptrace-after-seccomp patches, which are in 4.8, but probably not backported to kernels like the CentOS7/RHEL one.

alexlarsson · 2017-01-16T19:42:03Z

For flatpak we disable ptrace by default unless you run with -d or grant "developer" permissions.

alexlarsson · 2017-01-16T19:43:01Z

This is also disabled with ptrace, but i believe that by itself should be ok, because the ptrace-after-seccomp issue is after ptrace has been enabled, no?

cgwalters · 2017-01-16T19:51:05Z

So...what one could argue here is we should really have a --disable-setsid runtime option. Then, this seccomp filter could live in flatpak, where it also knows that it has a filter to disable ptrace.

Conceptually, the CVE then isn't in bwrap - it's in any program which is using bwrap with a pty connected to separate security domains. There are other ways to fix the issue externally - for example, just don't provide a pty to the child process - if it's a background daemon, connect it to the e.g. systemd journal. Or, per above, install a seccomp filter in your software.

One argument that the setsid() invocation shouldn't have been added to bwrap at all is the fact that we support not passing --unshare-pid. If one does that it's obviously trivial to kill processes outside.

Another really important example is we support providing /home into the container, which may have ssh/etc key material...

So there's all of this "best practice" stuff that needs to live outside. Hence then, why don't we add a command line option --disable-setsid to allow programs to opt-in to disabling it?

alexlarsson · 2017-01-16T19:56:52Z

That sounds good to me, although --disable-setsid is perhaps inverted, something like --new-session might be better (or you could just run setsid bwrap ...

cgwalters · 2017-01-16T20:00:07Z

The reason I phrased it as --disable-setsid is theoretically (very very theoretically...) there is some software out there relying on the fact we added setsid() in 0.1.6.

That said...I think I'm convincing myself we should do this:

Revert the addition of setsid() (or as you suggest make it an option - why not, it's convenient)
Change demos/bubblewrap-shell.sh to do it along with other hardening (unsharing all of the namespaces, explicitly not inheriting /home, perhaps too scrubbing the environment? (one often has auth tokens there))
Change flatpak to add the seccomp filter, and point users at that as a best practice if you want to retain a pty

alexlarsson · 2017-01-16T20:01:35Z

By 1) do you mean invert its meaning? Or rely on apps to use setsid(1)?

smcv · 2017-01-16T20:05:39Z

I'd prefer --disable-setsid (or maybe turn its meaning around and call it --keep-terminal) rather than --enable-setsid, so that the simplest possible use is the most-sandboxed, and you punch holes; and, in particular, so that people using bwrap for non-Flatpak things that we don't know about are protected against contained processes typing into their terminals.

That's consistent with how bwrap does filesystems (if you don't do any --bind then you don't get to see those files), although admittedly not consistent with the --unshare-foo family of options.

alexlarsson · 2017-01-16T20:08:04Z

Have you tried to use bwarp with setsid though? Its extremely easy to get into very confusing situations where the terminal is essentially unusable.

In discussion in containers#150 it was noted that most of the bwrap command line tends towards "closed by default, request open". But the `--unshare` options are inverse. Now, I suspect in practice there's only one namespace that most users will care about, which is the network namespace. There are very useful programs to build on both cases. I think everything else (pid, ipc, uts) people will want as a group. Any cases that are unusual enough to want to turn one of them off can still fall back to the previous bwrap behavior of explicitly unsharing. They're likely to be security sensitive enough that if a new namespace were added, it would make sense to evaluate the tool. But again I think most users will want all namespaces, with the network one as a primary "enable it" option.

alexlarsson · 2017-01-17T09:17:39Z

So, the reason I dislike --disable-setsid is, if we ignore for the moment the CVE, that we're introducing a new default that changes the semantics of the sandbox. Suddenly we're making something that worked (such as flatpak) and essentially break it if you update bubblewrap (because, with bubblewrap 0.1.6 doing development with flatpak is basically broken).

I think most users of bubblewrap want as-secure-as-possible, but don't break my app. However, this is really really hard to guarantee for random apps, so the only guaranteed way is to add sandboxing features as default open, ask for limit.

For example, I can add a --disable-setsid to bubblewrap, and then use that from flatpak. However, this means the next flatpak has to require the newer bubblewrap (will fail with the old one with unknown switch). My plan was to do a stable bugfix-only release of flatpak and hope stable distros (like Debian 9) could just pick it up. However, thay may be foiled by having to rely on a new bubblewrap that adds new features (the switch).

alexlarsson · 2017-01-17T11:43:43Z

@cgwalters #154 has an approach like you suggested above instead. Then I'll add the seccomp rule to flatpak instead.

rh-atomic-bot · 2017-01-17T13:46:00Z

☔ The latest upstream changes (presumably a6e1516) made this pull request unmergeable. Please resolve the merge conflicts.

In discussion in containers#150 it was noted that most of the bwrap command line tends towards "closed by default, request open". But the `--unshare` options are inverse. Now, I suspect in practice there's only one namespace that most users will care about, which is the network namespace. There are very useful programs to build on both cases. I think everything else (pid, ipc, uts) people will want as a group. Any cases that are unusual enough to want to turn one of them off can still fall back to the previous bwrap behavior of explicitly unsharing. They're likely to be security sensitive enough that if a new namespace were added, it would make sense to evaluate the tool. But again I think most users will want all namespaces, with the network one as a primary "enable it" option.

In discussion in #150 it was noted that most of the bwrap command line tends towards "closed by default, request open". But the `--unshare` options are inverse. Now, I suspect in practice there's only one namespace that most users will care about, which is the network namespace. There are very useful programs to build on both cases. I think everything else (pid, ipc, uts) people will want as a group. Any cases that are unusual enough to want to turn one of them off can still fall back to the previous bwrap behavior of explicitly unsharing. They're likely to be security sensitive enough that if a new namespace were added, it would make sense to evaluate the tool. But again I think most users will want all namespaces, with the network one as a primary "enable it" option. Closes: #153 Approved by: alexlarsson

cgwalters · 2017-01-17T17:52:06Z

It's still an open question a bit to me whether we want to add any seccomp to bwrap itself.

hartwork · 2023-03-14T22:10:02Z

If this gets picked up later, please note that it's not just TIOCSTI but also TIOCLINUX, see https://github.com/jwilk/ttyjack for proof and details. For anyone who also likes to play with a related seccomp filter and e.g. pass it to bubblewrap via --seccomp without need to create a BPF program by hand, please see https://github.com/hartwork/antijack .

hartwork · 2023-03-16T00:10:05Z

bubblewrap.c

+      }
+
+    if (seccomp_rule_add (ctx, SCMP_ACT_ERRNO(EPERM), SCMP_SYS(ioctl), 1,
+                          SCMP_A1(SCMP_CMP_EQ, (int)TIOCSTI)) < 0)


This would need SCMP_CMP_MASKED_EQ rather than SCMP_CMP_EQ to not re-introduce CVE-2019-10063. Sending TIOCSTI + 0x100000000 (eight zeros) can be used for a test.

fixup! Use seccomp instead of setsid() to workaround CVE-2017-5226

ba31118

cgwalters reviewed Jan 16, 2017

View reviewed changes

fixup! Use seccomp instead of setsid() to workaround CVE-2017-5226

7777190

fixup! Use seccomp instead of setsid() to workaround CVE-2017-5226

c705959

cgwalters mentioned this pull request Jan 16, 2017

[merged] Add --unshare-all and --share-net #153

Closed

alexlarsson mentioned this pull request Jan 17, 2017

[merged] Make the call to setsid() optional, with --new-session #154

Closed

hartwork mentioned this pull request Feb 28, 2023

"--new-session" underadvertised and CVE-2017-5226 still a thing in 2023 by default? #555

Open

hartwork reviewed Mar 16, 2023

View reviewed changes

Use seccomp instead of setsid() to workaround CVE-2017-5226 #150

Are you sure you want to change the base?

Use seccomp instead of setsid() to workaround CVE-2017-5226 #150

Conversation

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters Jan 16, 2017

Choose a reason for hiding this comment

cgwalters Jan 16, 2017

Choose a reason for hiding this comment

alexlarsson Jan 16, 2017

Choose a reason for hiding this comment

cgwalters commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

cgwalters commented Jan 16, 2017 • edited Loading

alexlarsson commented Jan 16, 2017

smcv commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017 • edited Loading

alexlarsson commented Jan 16, 2017

cgwalters commented Jan 16, 2017 • edited Loading

alexlarsson commented Jan 16, 2017

smcv commented Jan 16, 2017

alexlarsson commented Jan 16, 2017

alexlarsson commented Jan 17, 2017

alexlarsson commented Jan 17, 2017

rh-atomic-bot commented Jan 17, 2017

cgwalters commented Jan 17, 2017

hartwork commented Mar 14, 2023

hartwork Mar 16, 2023

Choose a reason for hiding this comment

cgwalters commented Jan 16, 2017 •

edited

Loading

cgwalters commented Jan 16, 2017 •

edited

Loading

cgwalters commented Jan 16, 2017 •

edited

Loading