✔ true multithreading · wasmtime

Hello, I'm a bit confused about how true multi-threading is achieved with Wasmtime. As far as I currently understand a potential implementation is:

Create a single Engine and use e.g. tokio to create green threads where each green thread instantiates a component.
But in that case the Engine still only really exists on a single OS thread? Should I create an Engine per OS thread?

Chris Fallin (Aug 10 2023 at 15:45):

we do have support for the wasi-threads proposal afaik -- cc @Andrew Brown for more details?

Alex Crichton (Aug 10 2023 at 15:51):

Multithreading today can happen with concurrent execution of unrelated wasm instances in different threads (e.g. one Engine plus one Store per-instance-per-thread)

Merlijn Sebrechts (Aug 10 2023 at 15:51):

afaik, wasi-threads is used to have the guest spawn multiple threads. Instead, we would like to run multiple guests in parallel, if that makes sense?

Merlijn Sebrechts (Aug 10 2023 at 15:53):

We're not talking about multiple threads within the same wasm instance. Instead, we're talking about multiple instances running in parallel on the same engine, if that makes sense.

Alex Crichton (Aug 10 2023 at 15:54):

Yes wasi-threads isn't required for multiple guests in parallel and that's supported today with wasmtime. And yes you'd have one Engine and one store per instance

Alex Crichton (Aug 10 2023 at 15:54):

er sorry, one Engine in total, and then every store-per-instance would refer to the same Engine

Merlijn Sebrechts (Aug 10 2023 at 15:55):

Do you need the "async" functionality in order to have multiple instances on the same engine?

Alex Crichton (Aug 10 2023 at 15:55):

Alex Crichton (Aug 10 2023 at 15:56):

Chris Fallin (Aug 10 2023 at 15:56):

another useful detail: an Engine is internally an Arc wrapper around the actual data; so it's meant to be shared between threads, cloned cheaply, etc

Chris Fallin (Aug 10 2023 at 15:57):

likewise Modules can be (and should be) loaded once and shared between threads

Merlijn Sebrechts (Aug 10 2023 at 16:00):

OK, still trying to figure out what async does exactly. When you have two instances calling a regular (not async) host function. Does one instance have to wait on the other call to complete?

Alex Crichton (Aug 10 2023 at 16:02):

No they can complete in parallel. You'll probably use a single Linker which provides a single definition of the host function, and the host function is defined as impl Fn(...) + Sync which indicates it'll be called in parallel

Alex Crichton (Aug 10 2023 at 16:02):

instance-local state is provided via the Caller<'_, T> function parameter to the host function which can be used to access the T in the Store<T> which has per-instance state

Alex Crichton (Aug 10 2023 at 16:03):

Merlijn Sebrechts (Aug 10 2023 at 16:05):

Alex Crichton (Aug 10 2023 at 16:05):

yeah for example if you want to prevent an infinite loop from hogging time from other instances

Merlijn Sebrechts (Aug 10 2023 at 16:06):

So if I have a PC with 4 cores, and 5 instances with an infinite loop, only 4 will actually execute at all times?

Alex Crichton (Aug 10 2023 at 16:07):

it depends, if those 5 instances are in 5 OS threads then no, the OS will time-slice between threads

Alex Crichton (Aug 10 2023 at 16:08):

if those 5 instances are on 4 OS threads then yes, those 4 OS threads will be locked up by the infinite loops

Alex Crichton (Aug 10 2023 at 16:08):

you can set up something like epochs/fuel to time out the wasm instances, but the OS threads will be locked up while the infinite loop is executing othewise

Merlijn Sebrechts (Aug 10 2023 at 16:11):

Does the instance run in the OS thread where it is started? Or is there a separate thread where the engine + instances run? I was confused by the explanation about the engine being "shared" between threads.

Alex Crichton (Aug 10 2023 at 16:14):

Alex Crichton (Aug 10 2023 at 16:15):

Wasmtime doesn't spawn any threads for wasm execution, that's up to the embedder to configure

Alex Crichton (Aug 10 2023 at 16:15):

sharing the engine is done by the embedder as well by having it as part of the closed-over-state for new threads

Merlijn Sebrechts (Aug 10 2023 at 16:17):

Alex Crichton (Aug 10 2023 at 16:18):

let engine = ...;
std::thread::spawn(move || {
    foo(&engine); // the outer closure has `engine` in its closed-over-state
});

Emiel Van Severen (Aug 10 2023 at 16:18):

So if e.g. I use tokio to spawn instances, they could if not careful, spawn on the same thread as my actual executor, blocking new instances of spawning?

Alex Crichton (Aug 10 2023 at 16:19):

correct, if using tokio it's recommended to enable async support (since your host functions are probably async) and to additionally enable epochs plus epoch_deadline_yield_and_update to enable time-slicing and avoiding blocking the executor

Alex Crichton (Aug 10 2023 at 16:20):

that forces wasm to "yield" every period of epoch intervals which provides the ability to cancel wasm (e.g. time it out) or otherwise just let other stuff run if there's more

Chris Fallin (Aug 10 2023 at 16:22):

(just as a point of reference, this isn't really a problem unique to Wasmtime: any system that calls guest code from an executor has to worry about that code blocking the executor loop)

Merlijn Sebrechts (Aug 10 2023 at 16:27):

Ok, we're seeing this behavior with sleep too. Is that because, when a guest calls "sleep", it sleeps the entire OS thread? Or is it simply that the OS thread is locked by the executor doing someting like "poll()" in that thread?

Chris Fallin (Aug 10 2023 at 16:28):

yes, it'd be the same as native Rust code called by an async executor calling the system sleep(). Runtimes like tokio provide alternatives for things that would usually block, that integrate with the executor to yield instead

Chris Fallin (Aug 10 2023 at 16:28):

(including sleeps, async versions of IO, async versions of mutexes and queues, that sort of thing)

Merlijn Sebrechts (Aug 10 2023 at 16:29):

Does this mean that the fuel concept doesn't help trying to unblock threads with sleeping guests?

Chris Fallin (Aug 10 2023 at 16:29):

if you call into the kernel and ask for the entire thread to sleep, there's nothing we can do to stop that

Merlijn Sebrechts (Aug 10 2023 at 16:30):

fitzgen (he/him) (Aug 10 2023 at 16:30):

Chris Fallin (Aug 10 2023 at 16:31):

ah, unclear, sorry, I had thought you meant you had a hostcall that directly called sleep

Merlijn Sebrechts (Aug 10 2023 at 16:31):

Merlijn Sebrechts (Aug 10 2023 at 16:34):

Hopefully, my final question: we'd like to launch X instances on X new OS threads, so the total number of threads is X+1 (thread launching other threads). Is this possible with Tokio?

Alex Crichton (Aug 10 2023 at 16:35):

One option is to use std::thread::spawn which guarantees a thread is spawned. Another is to use spawn_blocking but that won't guarantee a thread is spawned (e.g. the thread pool may already be full)

Merlijn Sebrechts (Aug 10 2023 at 16:36):

Notification Bot (Aug 10 2023 at 16:38):

Notification Bot (Aug 10 2023 at 16:39):

Pat Hickey (Aug 10 2023 at 16:57):

that is true for the synchronous implementation of wasi preview 1 (wasi-cap-std-sync). for the async impl of wasi preview 1, and all impls of preview 2, its actually just a tokio yield for the duration of the sleep

Stream: wasmtime

Topic: ✔ true multithreading

Emiel Van Severen (Aug 10 2023 at 15:33):

Chris Fallin (Aug 10 2023 at 15:45):

Alex Crichton (Aug 10 2023 at 15:51):

Merlijn Sebrechts (Aug 10 2023 at 15:51):

Merlijn Sebrechts (Aug 10 2023 at 15:53):

Alex Crichton (Aug 10 2023 at 15:54):

Alex Crichton (Aug 10 2023 at 15:54):

Merlijn Sebrechts (Aug 10 2023 at 15:55):

Alex Crichton (Aug 10 2023 at 15:55):

Alex Crichton (Aug 10 2023 at 15:56):

Chris Fallin (Aug 10 2023 at 15:56):

Chris Fallin (Aug 10 2023 at 15:57):

Merlijn Sebrechts (Aug 10 2023 at 16:00):

Alex Crichton (Aug 10 2023 at 16:02):

Alex Crichton (Aug 10 2023 at 16:02):

Alex Crichton (Aug 10 2023 at 16:03):

Alex Crichton (Aug 10 2023 at 16:03):

Merlijn Sebrechts (Aug 10 2023 at 16:05):

Alex Crichton (Aug 10 2023 at 16:05):

Merlijn Sebrechts (Aug 10 2023 at 16:06):

Alex Crichton (Aug 10 2023 at 16:07):

Alex Crichton (Aug 10 2023 at 16:08):

Alex Crichton (Aug 10 2023 at 16:08):

Merlijn Sebrechts (Aug 10 2023 at 16:11):

Alex Crichton (Aug 10 2023 at 16:14):

Alex Crichton (Aug 10 2023 at 16:15):

Alex Crichton (Aug 10 2023 at 16:15):

Merlijn Sebrechts (Aug 10 2023 at 16:17):

Alex Crichton (Aug 10 2023 at 16:18):

Emiel Van Severen (Aug 10 2023 at 16:18):

Alex Crichton (Aug 10 2023 at 16:19):

Alex Crichton (Aug 10 2023 at 16:20):

Chris Fallin (Aug 10 2023 at 16:22):

Merlijn Sebrechts (Aug 10 2023 at 16:27):

Chris Fallin (Aug 10 2023 at 16:28):

Chris Fallin (Aug 10 2023 at 16:28):

Merlijn Sebrechts (Aug 10 2023 at 16:29):

Chris Fallin (Aug 10 2023 at 16:29):

Merlijn Sebrechts (Aug 10 2023 at 16:30):

fitzgen (he/him) (Aug 10 2023 at 16:30):

Chris Fallin (Aug 10 2023 at 16:31):

Merlijn Sebrechts (Aug 10 2023 at 16:31):

Merlijn Sebrechts (Aug 10 2023 at 16:34):

Alex Crichton (Aug 10 2023 at 16:35):

Merlijn Sebrechts (Aug 10 2023 at 16:36):

Notification Bot (Aug 10 2023 at 16:38):

Notification Bot (Aug 10 2023 at 16:38):

Notification Bot (Aug 10 2023 at 16:39):

Pat Hickey (Aug 10 2023 at 16:57):

Merlijn Sebrechts (Aug 10 2023 at 16:57):