Add a "bounded runtime" mode to guest execution. #612

cfallin · 2020-11-21T01:29:13Z

This modifies the instruction-counting mechanism to track an
instruction-count bound as well; and when a bound is set, the Wasm guest
will make a hostcall to yield back to the host context.

This is all placed under the run_async() function, so bounded-execution
yields manifest as futures that are immediately ready to continue execution.
This should give an executor main loop a chance to do other work at regular
intervals.

lucet-runtime/src/c_api.rs

lucet-runtime/tests/instruction_counting.rs

lucet-runtime/lucet-runtime-internals/src/future.rs

cfallin · 2020-11-30T19:35:24Z

Just made a few tweaks to (hopefully) get a green CI again; recent changes in wasmtime mean we'll need to fix a few things in Lucet before merging this PR.

The GuestMemory API changed; @pchickey I think the impl of this trait in Lucet needs to be updated as well?
There were some build errors related to hostcall types; @fst-crenshaw this is I think related to your work in Fixes #2418: Enhance wiggle to generate its UserErrorConverstion trait with a function that returns Result<abi_err, String> wasmtime#2419 -- will there be a Lucet update as part of that work too?

I'm happy to do either or both of these if needed, just let me know and I'll dig in further to understand the changes better :-) Thanks!

iximeow · 2020-11-30T19:44:39Z

@cfallin i've cooked up a patch for point 1, and am figuring out point 2 at the moment - i expect we can piece those changes out with a wasmtime update independent of this PR landing, so we don't have to worry about tying up too much in one PR

lucet-wiggle/src/lib.rs

cfallin · 2020-11-30T23:54:13Z

Rebased on top of #614; this should be ready for review now (thanks @iximeow for the quick work on that PR!).

acfoltzer

Thanks so much for getting this implemented! It's going to make a huge difference for async Lucet embedders.

I have a handful of suggestions here but nothing fundamental.

lucet-runtime/lucet-runtime-internals/src/future.rs

lucet-runtime/lucet-runtime-internals/src/instance.rs

lucet-runtime/lucet-runtime-internals/src/future.rs

lucetc/src/compiler.rs

lucetc/src/function.rs

acfoltzer · 2021-01-09T01:36:54Z

Ah, it occurred to me that we also need a version of Instance::run_start() that has the same bounded time properties. @cfallin would you be able to add this before we merge?

This modifies the instruction-counting mechanism to track an instruction-count bound as well; and when a bound is set, the Wasm guest will make a hostcall to yield back to the host context. This is all placed under the `run_async()` function, so bounded-execution yields manifest as futures that are immediately ready to continue execution. This should give an executor main loop a chance to do other work at regular intervals.

cfallin · 2021-01-11T18:56:17Z

Updated, and added the async Instance::run_async_start() as well.

This PR revises the simple instruction-count binding work, which initially loaded and stored the instruction-count field from memory on every basic block and emitted an extra conditional branch and yielding block after every basic block, with two optimizations: 1. The instruction count is kept in a local variable (which the register allocator will ideally keep in a register unless spills are necessary), and this count is saved back to the field in memory only when necessary, e.g., before calls and returns. 2. The checks and conditional branches are inserted only in places where necessary to ensure bounded time between checks, at the cost of slightly more overrun/approximation in the bound: specifically, before calls and returns (to avoid unbounded runtime due to recursion) and at loop backedges.

acfoltzer

Thank you for the fast work on this!

benaubin · 2021-01-14T23:16:58Z

Curious - would it be possible to expose these APIs publicly?

cfallin · 2021-01-14T23:30:38Z

@benaubin run_async() is a public method on InstanceHandle already -- or did you mean some other part of the API?

benaubin · 2021-01-14T23:39:28Z

@cfallin I'd love to have access to the InternalRunResult without using run_async in order to set an approximate CPU usage limit (a limit higher than reasonably expected, mostly to stop a while true {}). We plan to bill on CPU time, and can't do so with the run_async API.

cfallin · 2021-01-14T23:53:29Z

@benaubin I see -- the InternalRunResult is an implementation detail (it's how the yield loop in run_async() is made aware of time-bound yields), so I don't think it makes sense to expose that type as-is.

If you are not also using the value-yield functionality in hostcalls, you can implement your use-case with run_async() as-is, I think: manually inspect the returned future and consider the result to be a timeout if it is not ready after the first invocation.

Otherwise, the right thing to do would be to have a separate run_with_timeout() API, I think: this is semantically different than the existing run methods, all of which run to completion (possibly with yields) and so its type needs to be different.

If the latter is what you need, I'm happy to review a PR! Unfortunately I don't have time to work on this myself at the moment though.

benaubin · 2021-01-14T23:58:49Z

Happy to do a PR. Renaming InternalRunResult to BoundedRunResultand exposing it would provide a perfect run_with_timeout API as-is.

I'd be happy to build a timeout around CPU time as well, but it would require adding calls to libc for measuring the thread's cpu time - bloat that might not be applicable to all users.

benaubin · 2021-01-15T00:03:35Z

Oh, I see what you mean! There would need to be a run method that returns BoundedRunResult. I'll add a run_bounded method.

cfallin · 2021-01-15T00:37:06Z

@benaubin One other thing that I should probably note also, is that longer-term, the plan is to merge Lucet's functionality into Wasmtime (see blog post from Bytecode Alliance and another blog post on the merging efforts). There are active efforts to make this happen, e.g. async support in bytecodealliance/rfcs#2 and fast instance allocation in bytecodealliance/rfcs#5. In the meantime Lucet is absolutely supported, and as mentioned I'm happy to review a PR; but this may be relevant for your planning :-)

benaubin · 2021-01-15T00:40:20Z

Thanks for the heads up! We saw that, but want lucet's Send instances. Our host call surface is purposefully minimal (instances mostly communicate with an external API server) - which should make switching less painful (and sandboxing slightly easier).

benaubin · 2021-01-15T00:49:28Z

@cfallin I've put an initial draft PR together (#625). Adds a few _bounded methods, and replaces InternalRunResult with an additional Error variant.

benaubin · 2021-01-15T22:22:05Z

@cfallin On second thought, you were probably right that the InternalRunResult should remain a private implementation detail. I refactored the run_async_impl into a custom Future implementation, which gives the same benefit as exposing the bounded runtime apis, without complicating the API surface. Could you review PR #626?

iximeow reviewed Nov 21, 2020

View reviewed changes

lucet-runtime/src/c_api.rs Outdated Show resolved Hide resolved

iximeow reviewed Nov 21, 2020

View reviewed changes

lucet-runtime/tests/instruction_counting.rs Show resolved Hide resolved

alexcrichton reviewed Nov 23, 2020

View reviewed changes

lucet-runtime/lucet-runtime-internals/src/future.rs Outdated Show resolved Hide resolved

cfallin force-pushed the cfallin/bounded-execution branch from d5adc00 to 86ca1d1 Compare November 23, 2020 18:43

cfallin changed the title ~~WIP: add a "bounded runtime" mode to guest execution.~~ Add a "bounded runtime" mode to guest execution. Nov 23, 2020

alexcrichton reviewed Nov 23, 2020

View reviewed changes

lucet-runtime/lucet-runtime-internals/src/future.rs Outdated Show resolved Hide resolved

cfallin force-pushed the cfallin/bounded-execution branch 5 times, most recently from 58a5c8c to cc3ce00 Compare November 25, 2020 07:29

cfallin mentioned this pull request Nov 25, 2020

Fix Wasm translator bug: end of toplevel frame is branched-to only for fallthrough returns. bytecodealliance/wasmtime#2450

Merged

acfoltzer self-requested a review November 30, 2020 18:22

pchickey reviewed Nov 30, 2020

View reviewed changes

lucet-wiggle/src/lib.rs Outdated Show resolved Hide resolved

cfallin force-pushed the cfallin/bounded-execution branch 2 times, most recently from e4e95dc to 4e25235 Compare November 30, 2020 23:53

acfoltzer suggested changes Jan 9, 2021

View reviewed changes

cfallin force-pushed the cfallin/bounded-execution branch from 4e25235 to 276d018 Compare January 11, 2021 18:54

cfallin force-pushed the cfallin/bounded-execution branch from 276d018 to ea3c239 Compare January 11, 2021 18:57

acfoltzer approved these changes Jan 11, 2021

View reviewed changes

cfallin merged commit 3d8df65 into main Jan 11, 2021

cfallin deleted the cfallin/bounded-execution branch January 11, 2021 19:47

benaubin mentioned this pull request Jan 15, 2021

Expose bounded runtime #625

Closed

benaubin mentioned this pull request Jan 15, 2021

Refactor run_async to use custom Future implementation #626

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a "bounded runtime" mode to guest execution. #612

Add a "bounded runtime" mode to guest execution. #612

cfallin commented Nov 21, 2020 •

edited

Loading

cfallin commented Nov 30, 2020

iximeow commented Nov 30, 2020

cfallin commented Nov 30, 2020 •

edited

Loading

acfoltzer left a comment

acfoltzer commented Jan 9, 2021

cfallin commented Jan 11, 2021

acfoltzer left a comment •

edited

Loading

benaubin commented Jan 14, 2021

cfallin commented Jan 14, 2021

benaubin commented Jan 14, 2021 •

edited

Loading

cfallin commented Jan 14, 2021

benaubin commented Jan 14, 2021 •

edited

Loading

benaubin commented Jan 15, 2021

cfallin commented Jan 15, 2021

benaubin commented Jan 15, 2021 •

edited

Loading

benaubin commented Jan 15, 2021

benaubin commented Jan 15, 2021 •

edited

Loading

Add a "bounded runtime" mode to guest execution. #612

Add a "bounded runtime" mode to guest execution. #612

Conversation

cfallin commented Nov 21, 2020 • edited Loading

cfallin commented Nov 30, 2020

iximeow commented Nov 30, 2020

cfallin commented Nov 30, 2020 • edited Loading

acfoltzer left a comment

Choose a reason for hiding this comment

acfoltzer commented Jan 9, 2021

cfallin commented Jan 11, 2021

acfoltzer left a comment • edited Loading

Choose a reason for hiding this comment

benaubin commented Jan 14, 2021

cfallin commented Jan 14, 2021

benaubin commented Jan 14, 2021 • edited Loading

cfallin commented Jan 14, 2021

benaubin commented Jan 14, 2021 • edited Loading

benaubin commented Jan 15, 2021

cfallin commented Jan 15, 2021

benaubin commented Jan 15, 2021 • edited Loading

benaubin commented Jan 15, 2021

benaubin commented Jan 15, 2021 • edited Loading

cfallin commented Nov 21, 2020 •

edited

Loading

cfallin commented Nov 30, 2020 •

edited

Loading

acfoltzer left a comment •

edited

Loading

benaubin commented Jan 14, 2021 •

edited

Loading

benaubin commented Jan 14, 2021 •

edited

Loading

benaubin commented Jan 15, 2021 •

edited

Loading

benaubin commented Jan 15, 2021 •

edited

Loading