wasi-nn: add named models #6854

abrown · 2023-08-17T00:50:44Z

This implements named models in Wasmtime; see the commit messages for more details.

abrown · 2023-08-17T00:56:28Z

ci/run-wasi-nn-example.sh

+cp target/wasm32-wasi/release/wasi-nn-example-named.wasm $TMP_DIR
+popd
+cargo run -- run --mapdir fixture::$TMP_DIR --graph openvino::$TMP_DIR \
+    --wasi-modules=experimental-wasi-nn $TMP_DIR/wasi-nn-example-named.wasm


I'm not quite happy with using this script for testing. It was good initially to get things working but I would prefer to be using cargo test instead. The problem is that there is a dependency expectation: this can only run if the system has OpenVINO installed, which will not always be the case for developers running cargo test.

I can think of two workarounds:

#[ignore] the OpenVINO-specific tests and add some comments telling developers to turn them on to fully test the system

add some checks at the beginning of each test to early-exit if OpenVINO is not available; the issue with this is that users may see "false success" if something goes wrong in the environment, so perhaps a FORCE_RUN_OPENVINO_TESTS env variable is necessary?

Interested in some feedback on whether one of those approaches is better than this script!

Personally I like your second suggestion and matches what I was going to suggest as well. We do run a higher risk of not actually running the tests in CI but I think running something by default as part of cargo test --workspace is a good mitigating factor for that.

If the tests are for wasi-nn, we should just have a test backend that does something basic like an affine layer.

If this is testing integration we should think about how we handle multiple backends. This same issue will come up with pytorch, onnxruntime, tf, etc.

@alexcrichton, I have a whole other set of commits ready for switching to that kind of testing. Just waiting on a review of this one because the new testing also covers the load_by_name logic added here.

Ok, check out #6895 if you're interested in where I went with this.

abrown · 2023-08-17T16:36:24Z

crates/wasi-nn/src/registry/mod.rs

+pub use in_memory::InMemoryRegistry;
+
+pub trait GraphRegistry: Send + Sync {
+    fn get_mut(&mut self, name: &str) -> Option<&mut Graph>;


Another thing I'm not quite sure of is whether this should return Option<&mut Graph>, Option<&Graph>, or even Option<Graph> (seeing as how Graph is just an Arc-wrapper). We immediately clone the graph and I removed the mutability requirement for BackendGraph::init_execution_context... @geekbeast, any opinions from implementing other kinds of GraphRegistry?

alexcrichton

I can't review the wasi-nn stuff specifically really, it all seems reasonable though. The src/commands/run.rs bits look good to me, although I might recommend renaming --graph to something like --wasi-nn-graph or --nn-graph or something like that perhaps to make it a bit more clear it's for wasi-nn as opposed to something else graph-related (not that we have much else like that right now)

This change adds a way to retrieve preloaded ML models (i.e., "graphs" in wasi-nn terms) from a registry. The wasi-nn specification includes a new function, `load_by_name`, that can be used to access these models more efficiently than before; previously, a user's only option was to read/download/etc. all of the bytes of an ML model and pass them to the `load` function. [named models]: WebAssembly/wasi-nn#36 In Wasmtime's implementation of wasi-nn, we call the registry that holds the models a `GraphRegistry`. We include a simplistic `InMemoryRegistry` for use in the Wasmtime CLI (more on this later) but the idea is that production use will involve some more complex caching and thus a new implementation of a registry--a `Box<dyn GraphRegistry>`--passed into the wasi-nn context. Note that, because we now must be able to `clone` a graph out of the registry and into the "used graphs" table, the OpenVINO `BackendGraph` is updated to be easier to copy around. To allow experimentation with this "preload a named model" functionality, this change also adds a new Wasmtime CLI flag: `--graph <encoding>:<host dir>`. Wasmtime CLI users can now preload a model from a directory; the directory `basename` is used as the model name. Loading models from a directory is probably not desired in Wasmtime embeddings so it is cordoned off into a separate `BackendFromDir` extension trait.

Add a new example crate which loads a model by name and performs image classification. It uses the same MobileNet model as the existing test but a new version of the Rust bindings. The new crate is built and run with the new CLI flag in the `ci/run-wasi-nn-example.sh` script. prtest:full

* wasi-nn: add [named models] This change adds a way to retrieve preloaded ML models (i.e., "graphs" in wasi-nn terms) from a registry. The wasi-nn specification includes a new function, `load_by_name`, that can be used to access these models more efficiently than before; previously, a user's only option was to read/download/etc. all of the bytes of an ML model and pass them to the `load` function. [named models]: WebAssembly/wasi-nn#36 In Wasmtime's implementation of wasi-nn, we call the registry that holds the models a `GraphRegistry`. We include a simplistic `InMemoryRegistry` for use in the Wasmtime CLI (more on this later) but the idea is that production use will involve some more complex caching and thus a new implementation of a registry--a `Box<dyn GraphRegistry>`--passed into the wasi-nn context. Note that, because we now must be able to `clone` a graph out of the registry and into the "used graphs" table, the OpenVINO `BackendGraph` is updated to be easier to copy around. To allow experimentation with this "preload a named model" functionality, this change also adds a new Wasmtime CLI flag: `--graph <encoding>:<host dir>`. Wasmtime CLI users can now preload a model from a directory; the directory `basename` is used as the model name. Loading models from a directory is probably not desired in Wasmtime embeddings so it is cordoned off into a separate `BackendFromDir` extension trait. * wasi-nn: add "named model" test Add a new example crate which loads a model by name and performs image classification. It uses the same MobileNet model as the existing test but a new version of the Rust bindings. The new crate is built and run with the new CLI flag in the `ci/run-wasi-nn-example.sh` script. prtest:full * review: rename `--graph` to `--wasi-nn-graph`

abrown requested review from a team as code owners August 17, 2023 00:50

abrown requested review from fitzgen and pchickey and removed request for a team August 17, 2023 00:50

abrown commented Aug 17, 2023

View reviewed changes

fitzgen removed their request for review August 17, 2023 16:58

alexcrichton approved these changes Aug 22, 2023

View reviewed changes

abrown added 3 commits August 22, 2023 16:26

review: rename --graph to --wasi-nn-graph

6cff07c

abrown force-pushed the registries branch from 9207956 to 6cff07c Compare August 22, 2023 23:26

abrown enabled auto-merge August 22, 2023 23:46

abrown added this pull request to the merge queue Aug 23, 2023

Merged via the queue into bytecodealliance:main with commit 770c5d0 Aug 23, 2023

abrown deleted the registries branch August 23, 2023 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wasi-nn: add named models #6854

wasi-nn: add named models #6854

abrown commented Aug 17, 2023

abrown Aug 17, 2023

alexcrichton Aug 18, 2023

geekbeast Aug 20, 2023

abrown Aug 22, 2023

abrown Aug 23, 2023

abrown Aug 17, 2023 •

edited

Loading

alexcrichton left a comment

wasi-nn: add named models #6854

wasi-nn: add named models #6854

Conversation

abrown commented Aug 17, 2023

abrown Aug 17, 2023

Choose a reason for hiding this comment

alexcrichton Aug 18, 2023

Choose a reason for hiding this comment

geekbeast Aug 20, 2023

Choose a reason for hiding this comment

abrown Aug 22, 2023

Choose a reason for hiding this comment

abrown Aug 23, 2023

Choose a reason for hiding this comment

abrown Aug 17, 2023 • edited Loading

Choose a reason for hiding this comment

alexcrichton left a comment

Choose a reason for hiding this comment

abrown Aug 17, 2023 •

edited

Loading