wasm: run wasmtime on the reactor threads #14046

rockwotj · 2023-10-09T17:05:53Z

This patch set moves our usage of Wasmtime over to using the async API for invoking the VM.

The async API is well documented here. In short, Wasm is executed on a seperate stack
(as well as our host functions) and the VM can swap back to the callers stack to "yield"
execution back to the caller. This will allow us to yield control back to the scheduler
so we don't go over time budget. This yielding behavior is not yet implemented, this only
sets us up so that we can do this, and it's only a matter of configuration and tuning
the amount of fuel we give the VM.

Moving on the reactor has greatly improved throughput of transforms based on my benchmarking
(3x more!), and will allow us to change our ABI so that we can use upstream golang in addition
to tinygo. More details on the new ABI will come in another patch set.

Some nuts + bolts/reviewer notes: The async API uses basically an entirely seperate set
of APIs - to make sure tests pass in every commit I just temporarily fork the implementation
to another file that isn't built then merge it back in. So individual commits may not be valid,
but are more broken up to hopefully aid the review process.

The set of APIs used here are very new to wasmtime, and if you're curious
about under the hood how some of these work, these PRs might be helpful
to review:

bytecodealliance/wasmtime#7140
bytecodealliance/wasmtime#7106

Backports Required

Release Notes

none

To pull in a fix for a header being typed incorrectly. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Upcoming commits are going to move wasmtime calls onto the reactor, which means our for_each_record helpers needs to be async. We still need the sync version as the current version runs on an alien thread. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Fork the wasmtime code into another file so that the following patches can incrementally change the impleemntation into logical pieces for reviewers. Wasmtime's async support requires an entire seperate set of APIs to be used, otherwise there are runtime errors, forking the implementation then merging it back in allows for functionality changes to be split up without needing to figure out how to make everything compile and pass tests at each commit. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

The working model here is that wasmtime allocates a seperate stack to execute Wasm on. That stack can be yielded to and will allow us to run this on the reactor directly. Our host functions are also executed this new stack, so now our host functions will only have 64 KiB to execute on. We should add some test/debug support for ensuring that our host functions do not use more stack memory than that. If more stack memory is required, the option of making the host function async is possible, as then the continuations will be executed on "normal" reactor thread stacks, otherwise we'll have to consider a seperate memory pool for stacks, as 128KiB is about the most we want to allocate at once in Redpanda. As another note these stacks are current mmap'd but there will be work done in Wasmtime to plugin a seperate allocator in the runtime. That will be a followup change to plug that in once it's ready. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

We're going to use wasmtime's async function support this so cleanup the host module registration by removing the alien 👾 Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Just a refactor to move the down so the engine is in scope. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Async host functions are implemented via polling similar to Rust's Future type. We implement this via a "done" boolean that is threaded around and we mark it as true when the future completes. Wasmtime ensures that all the arguments to the function will stay alive until our polling function returns true. There will be a followup where we will register the future with the engine to not poll the status until future has completed. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

This moves all code compilation to a single place. The prior implementation compiled all our native code, and then for each instance created small trampoline functions had to be compile to glue the module and it's host functions together. This allows us to make creating instances quicker (no alien threads) and simpler. Additionally, this commit removes the alien thread usage from the engine. This is currently using the blocking APIs but followup commits will use the async Wasmtime APIs. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

When we start an engine we now need to create an instance from the preinitialized instance. The reasoning for doing this is that we support restarting an engine by stopping and starting it and the expectation is that clears the memory for the instance. The only way to truly reset a store (the holder of memory for an instance) is by deleting it and creating a new one. We instantiate a preinitialized module using wasmtime's async API, which any call into the Wasm VM returns a future that each time the future is polled executes some work on the VM. Right now we have not (yet) configured the yielding behavior so yielding can only happen if an asynchronous host function is called. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Use wasmtime's async API to call the _start entry point to main for WASI. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

This uses wasmtime's async APIs to invoke the transform function that we expect to have exported as part of our ABI contract. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

rockwotj · 2023-10-10T02:14:43Z

/ci-repeat

Registering the futures of an async host function with the engine allows for not polling the wasm VM when we already know it's suspended because something in schema registry, etc is happening on another core. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

Using a 1:1 alien thread pool is wasteful in terms of memory (I believe by default threads have ~8 MB stacks in linux?), so only use a single thread. Possibly in the future we can temporarily spin up more threads if we need to compile many modules (like on startup for instance). Also we need to unblock wasmtime's signals on the reactor threads now that we're running wasm directly on the reactor thread. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

See the comment in the code, but seastar::thread's usage of swapcontext can re-block signals, so we need to unblock those in the tests when they run in debug mode. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

This switches our implementation of wasm::engine to use wasmtime's async implementation from the sync API. This commit is essentially `mv wasmtime_async.cc wasmtime.cc` Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

This was only used on an alien thread when wasm was executed in a blocking context, but now we're using the async version so it can be removed. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

rockwotj · 2023-10-10T14:19:25Z

Force push: Folded test fix back into commit history so that all commits pass tests

vbotbuildovich · 2023-10-10T16:15:57Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1a1f-ac66-4549-a505-28164f87298c

vbotbuildovich · 2023-10-10T16:58:22Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1a44-a9eb-4195-b1c4-3b7c702b0e59

rockwotj · 2023-10-10T17:13:00Z

CI Failures: #7148, #13997, #14053

dotnwat · 2023-10-10T21:55:47Z

src/v/wasm/wasmtime_async.cc

+        ssx::background
+          = std::move(host_future_result)
+              .then_wrapped(
+                [status, trap_ret, mem = std::move(mem)](ss::future<> fut) {
+                    if (fut.failed()) {
+                        auto msg = ss::format(
+                          "Failure executing host function: {}",
+                          fut.get_exception());
+                        *trap_ret = wasmtime_trap_new(msg.data(), msg.size());
+                    }
+                    *status = async_call_done::yes;
+                });
+
+        continuation->env = status;
+        continuation->finalizer = [](void* env) {
+            // NOLINTNEXTLINE(*owning-memory)
+            delete static_cast<async_call_done*>(env);
+        };
+        continuation->callback = [](void* env) {


interesting. so the caller that passes in the continuation will use the continuation to wait on the result of the backgrounded future to occur (or the trap_ret to be set)?

Yes, the actual implementation is here:
https://github.com/bytecodealliance/wasmtime/pull/7106/files#diff-609e139b86cac725103dd2d38ce4d0018e0eb41a56278255217354e523b784bdR117-R136

The .await is a fancy way of waiting until true is returned here, and stuff on the stack is kept alive, similar to a coroutine in C++, see this: https://github.com/bytecodealliance/wasmtime/pull/7106/files#diff-609e139b86cac725103dd2d38ce4d0018e0eb41a56278255217354e523b784bdR70-R81

But maybe my answer to your other question sheds light on whats happening under the covers here. It's really cool stuff!

got it, thanks. it is very cool.

continuation.await;

does this correspond to some thread managed by wasmtime?

nevermind i think it is answered in the next commen ton this pr related to poll

dotnwat · 2023-10-10T22:02:26Z

src/v/wasm/wasmtime_async.cc

+        while (!wasmtime_call_future_poll(fut.get())) {
+            co_await ss::coroutine::maybe_yield();


what's on the other side of this, like work on another stack or something? is this call driving execution of whatever is on the other side?

From https://docs.wasmtime.dev/api/wasmtime/struct.Config.html#asynchronous-wasm

The poll method of the futures returned by Wasmtime will perform the actual work of calling the WebAssembly. Wasmtime won’t manage its own thread pools or similar, that’s left up to the embedder.
To implement futures in a way that WebAssembly sees asynchronous host functions as synchronous, all async Wasmtime futures will execute on a separately allocated native stack from the thread otherwise executing Wasmtime. This separate native stack can then be switched to and from. Using this whenever an async host function returns a future that resolves to Pending we switch away from the temporary stack back to the main stack and propagate the Pending status.

Basically calling poll here switches to the "Wasm" stack and runs the VM. Host functions that are async end up being propagated to poll and the continuation we return in async host functions is the bit that checks if a poll is able to go back to executing Wasm within the VM. This is why we need to pass back the future from continutation otherwise we'll busy loop asking if the future is done, instead of letting seastar reschedule this fiber when the future is done (at that point the continuation will return true and go back to running the VM).

vbotbuildovich · 2023-10-11T02:27:05Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1c51-81d4-4849-a344-7078913348e2

vbotbuildovich · 2023-10-11T02:27:17Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1c51-6e26-49d1-8742-e72317e82ef6

vbotbuildovich · 2023-10-11T02:29:43Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1c51-6686-4247-9a4a-ed555914b132

vbotbuildovich · 2023-10-11T04:20:39Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1cba-1097-4043-b6a3-6bfa0df38b70

vbotbuildovich · 2023-10-11T04:21:08Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1cba-38b5-412a-9bc1-95fc91f3c06c

vbotbuildovich · 2023-10-11T07:31:13Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1d66-f64a-49d0-8b27-7d7c3886b2f8

vbotbuildovich · 2023-10-11T07:32:51Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1d67-0760-48fd-940a-14ff30113e4a

vbotbuildovich · 2023-10-11T12:47:25Z

ducktape was retried in job https://buildkite.com/redpanda/redpanda/builds/38630#018b1e8e-72ca-4fd7-ad69-c2c73c7f4d32

rockwotj added 10 commits October 9, 2023 05:24

cmake: update wasmtime

090045f

To pull in a fix for a header being typed incorrectly. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: Add async ABI helper

4a16a64

Upcoming commits are going to move wasmtime calls onto the reactor, which means our for_each_record helpers needs to be async. We still need the sync version as the current version runs on an alien thread. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: remove alien from host functions

998dcf0

We're going to use wasmtime's async function support this so cleanup the host module registration by removing the alien 👾 Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: Move host function registration code

1428d54

Just a refactor to move the down so the engine is in scope. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: Make calling wasi _start async

46fdc2e

Use wasmtime's async API to call the _start entry point to main for WASI. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

rockwotj requested review from BenPope and a team as code owners October 9, 2023 17:05

rockwotj requested review from savex and removed request for a team October 9, 2023 17:05

github-actions bot added area/build area/redpanda labels Oct 9, 2023

wasm: invoke exported guest function async

9d9b03a

This uses wasmtime's async APIs to invoke the transform function that we expect to have exported as part of our ABI contract. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

rockwotj force-pushed the async-wasm branch from 0a95e2c to 4d59a3a Compare October 9, 2023 17:18

rockwotj self-assigned this Oct 9, 2023

rockwotj requested review from dotnwat, oleiman, mmaslankaprv and michael-redpanda October 9, 2023 17:23

rockwotj added 5 commits October 10, 2023 09:18

wasm/tests: unblock wasmtime signals in debug

c65bced

See the comment in the code, but seastar::thread's usage of swapcontext can re-block signals, so we need to unblock those in the tests when they run in debug mode. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: merge async wasmtime implementation

914dda0

This switches our implementation of wasm::engine to use wasmtime's async implementation from the sync API. This commit is essentially `mv wasmtime_async.cc wasmtime.cc` Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

wasm: remove transform_module::for_each_record

c9ff0db

This was only used on an alien thread when wasm was executed in a blocking context, but now we're using the async version so it can be removed. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

rockwotj force-pushed the async-wasm branch from ea6d7cd to c9ff0db Compare October 10, 2023 14:18

dotnwat approved these changes Oct 10, 2023

View reviewed changes

piyushredpanda merged commit 8d57443 into redpanda-data:dev Oct 11, 2023

rockwotj deleted the async-wasm branch October 11, 2023 13:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wasm: run wasmtime on the reactor threads #14046

wasm: run wasmtime on the reactor threads #14046

rockwotj commented Oct 9, 2023 •

edited

Loading

rockwotj commented Oct 10, 2023

rockwotj commented Oct 10, 2023

vbotbuildovich commented Oct 10, 2023

vbotbuildovich commented Oct 10, 2023

rockwotj commented Oct 10, 2023

dotnwat Oct 10, 2023

rockwotj Oct 11, 2023

dotnwat Oct 11, 2023

dotnwat Oct 11, 2023 •

edited

Loading

dotnwat Oct 10, 2023

rockwotj Oct 11, 2023

dotnwat Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

		while (!wasmtime_call_future_poll(fut.get())) {
		co_await ss::coroutine::maybe_yield();

wasm: run wasmtime on the reactor threads #14046

wasm: run wasmtime on the reactor threads #14046

Conversation

rockwotj commented Oct 9, 2023 • edited Loading

Backports Required

Release Notes

rockwotj commented Oct 10, 2023

rockwotj commented Oct 10, 2023

vbotbuildovich commented Oct 10, 2023

vbotbuildovich commented Oct 10, 2023

rockwotj commented Oct 10, 2023

dotnwat Oct 10, 2023

Choose a reason for hiding this comment

rockwotj Oct 11, 2023

Choose a reason for hiding this comment

dotnwat Oct 11, 2023

Choose a reason for hiding this comment

dotnwat Oct 11, 2023 • edited Loading

Choose a reason for hiding this comment

dotnwat Oct 10, 2023

Choose a reason for hiding this comment

rockwotj Oct 11, 2023

Choose a reason for hiding this comment

dotnwat Oct 11, 2023

Choose a reason for hiding this comment

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

vbotbuildovich commented Oct 11, 2023

rockwotj commented Oct 9, 2023 •

edited

Loading

dotnwat Oct 11, 2023 •

edited

Loading