wasm validation inside of cranelift? · cranelift

I've got a question for folks who manage cranelift-in-spidermonkey, currently cranelift does no validation of the input wasm module assuming that the embedder does the actual validation (e.g. wasmtime calls wasmparser::validate and presumably spidermonkey validates elsewhere). I think though this may not be tenable too much further into the future.

Alex Crichton (Jul 13 2020 at 17:57):

One example is that ref.is_null no longer encodes its type in the instruction, so to figure out whether you're testing externref or funcref you need to look at the type stack

Alex Crichton (Jul 13 2020 at 17:57):

wasmtime gets away with this by using a different representation so we can look at the type of the cranelift value, but there may be instructions in the future which need non-local information about how to codegen something

Alex Crichton (Jul 13 2020 at 17:58):

Additionally validation has a nontrivial cost in wasmtime right now because we simply "run it up front" and throw away all the results, so for Wasmtime specifically it'd be best actually if the module was validated during translation (this would also make validation parallel, which it currently isn't in wasmtime)

Alex Crichton (Jul 13 2020 at 17:58):

so all of that to ask, what would folks think about adding validation interleaved with cranelift-wasm's translation?

Alex Crichton (Jul 13 2020 at 17:59):

Is this a cost specifically avoided in the spidermonkey integration? For example do we need to make validation a compile time or runtime switch for cranelift? Or would spidermonkey be ok if cranelift double-validated

Alex Crichton (Jul 13 2020 at 17:59):

Chris Fallin (Jul 13 2020 at 18:12):

AFAICT, the wasm module has already been parsed by other bits of SM's wasm frontend by the time that Cranelift is invoked to compile functions; there are a bunch of shims to move the types/signatures over that I haven't internalized. So validation that happens during wasm parsing in Cranelift would be an extra cost; but intuitively it seems to me that it might not be too much extra cost if the work is essentially shared by parsing and validation

Chris Fallin (Jul 13 2020 at 18:13):

My instinct at first, at least, is to say that it's better for CL to remain simple, rather than trying to make all validation conditional. If it's too much latency later, we can definitely come back and refactor. But @Benjamin Bouvier probably has better thoughts on this! (AFAIK he's out today and tomorrow)

Alex Crichton (Jul 13 2020 at 18:14):

That's a good point that I'm also not familiar with the spidermonkey integration, I'm assuming it just hands a blob of bytes to cranelift but I could be wrong

Alex Crichton (Jul 13 2020 at 18:14):

And yeah doing this conditionally would be tricky, mainly because I'm not sure how to conditionalize the translation phase where we actually need the type information

Alex Crichton (Jul 13 2020 at 18:14):

well, eventually would probably need the type information, today we're getting by ok

Chris Fallin (Jul 13 2020 at 18:17):

Chris Fallin (Jul 13 2020 at 18:19):

I'd need to read/think a bit more before saying what would make the most sense long-term... we definitely still need to parse for baseline compilation, before Cranelift is involved... but for now it seems parsing costs would be dwarfed by the other compilation stages so it doesn't seem like a big issue to me!

Alex Crichton (Jul 13 2020 at 18:19):

Alex Crichton (Jul 13 2020 at 18:20):

if cranelift-wasm changes in ways that spidermonkey doesn't even touch that's presumably A-OK heh

Chris Fallin (Jul 13 2020 at 18:20):

Benjamin Bouvier (Jul 15 2020 at 10:41):

Benjamin Bouvier (Jul 15 2020 at 10:42):

My thought is that ideally, we'd have a way to provide cranelift-wasm with module-level information (type section info, etc.), while cranelift-wasm would do the function validation.

Benjamin Bouvier (Jul 15 2020 at 10:42):

(Spidermonkey's Wasm.validate is super fast, but that's still an extra 100ms overhead for very large modules that only Cranelift has)

Benjamin Bouvier (Jul 15 2020 at 10:43):

(also Cranelift-in-SM does validate function's bodies in parallel, so the cost is a bit amortized)

Alex Crichton (Jul 15 2020 at 13:46):

That actually sounds perfect, and sounds like my conclusions reading code yesterday were indeed correct! I'll see if I can work on wasmparser to enable this (wasmparser basically already has support so it should be easy)

Alex Crichton (Jul 21 2020 at 22:45):

Validate modules while translating by alexcrichton · Pull Request #2059 · bytecodealliance/wasmtime

This commit is a change to cranelift-wasm to validate each function body as it is translated. Additionally top-level module translation functions will perform module validation. This commit builds ...

Yury Delendik (Jul 21 2020 at 22:46):

Alex Crichton (Jul 21 2020 at 22:46):

that PR now forcibly validates all wasm functions fed to cranelift-wasm, so this will affect Gecko (if that lands as-is). I'd definitely want folks familiar with the spidermonkey integration of Cranelift to take a look!

Alex Crichton (Jul 21 2020 at 22:47):

the knobs are now there for validation to be entirely deferred to cranelift (spidermonkey can provide its own hooks to learn about the module's other types), but that may not be sufficient. If so just let me know!

Benjamin Bouvier (Jul 22 2020 at 08:18):

Looks great! I'll be happy to integrate and test it in spidermonkey when the time comes !

Benjamin Bouvier (Jul 22 2020 at 08:21):

Benjamin Bouvier (Jul 24 2020 at 13:38):

Validate modules while translating by alexcrichton · Pull Request #2059 · bytecodealliance/wasmtime

Benjamin Bouvier (Jul 24 2020 at 14:03):

Oh i actually had another question about this: right now, Baldrdash will request module-level information lazily, so it'd make sense that we'd now implement a cache for this information, that we'd fill in incrementally as we compile the function (and share this information with other function compilations, since at this point the module-level info is immutable). Since the traits mention &self and not &mut self, I'd need to use interior mutability (RefCell) to be able to build up the cache information; are there other better ways that would avoid the dynamic RefCell checks on borrows?

Alex Crichton (Jul 28 2020 at 14:06):

@Benjamin Bouvier hey back now, thanks for taking a look! Is Self::Type the only thing you had an issue with?

Alex Crichton (Jul 28 2020 at 14:07):

If so I think I might just remove that trait and return wasmparser::Type instead since I'm not sure having an associated type for Type is all that worth it (all you can do is get wasmparser::Type anyway)

Benjamin Bouvier (Jul 28 2020 at 14:07):

also self vs mut self, but it's probably more open to discussion (and i can solve it locally with a RefCell iiuc, with an additional dynamic cost unfortunately)

Alex Crichton (Jul 28 2020 at 14:09):

Alex Crichton (Jul 28 2020 at 14:10):

Alex Crichton (Jul 28 2020 at 14:12):

Benjamin Bouvier (Jul 28 2020 at 14:13):

Alex Crichton (Jul 28 2020 at 14:14):

Benjamin Bouvier (Jul 28 2020 at 14:16):

Ok! Is the OnceCell::get_mut method cheaper to call than a RefCell::borrow_mut ? (I imagine yes, but not sure)

Alex Crichton (Jul 28 2020 at 14:17):

Benjamin Bouvier (Jul 28 2020 at 14:21):

Ok! Since the cache is filled up as the trait methods are called, i'm not sure it's going to need a long-lived borrow, but yeah, both would work. So it's not a strong requirement for Spidermonkey.

Benjamin Bouvier (Jul 28 2020 at 14:21):

Thanks for checking this! having just return wasmparser::Type would indeed be ideal

Alex Crichton (Jul 28 2020 at 14:26):

Benjamin Bouvier (Jul 28 2020 at 14:28):

Alex Crichton (Jul 28 2020 at 14:33):

Benjamin Bouvier (Jul 28 2020 at 14:34):

ah right, i was wondering if this would be future proof since the GC type proposal might introduced structured types that won't be primitive types

Benjamin Bouvier (Jul 28 2020 at 14:35):

Alex Crichton (Jul 28 2020 at 14:37):

Alex Crichton (Jul 28 2020 at 16:35):

Simplify the `WasmModuleResources` trait by alexcrichton · Pull Request #68 · bytecodealliance/wasm-tools

Don't have separate traits for tables/memories/globals when the wasmparser Copy types by value work for now. This may need to be extended later but we can presumably cross that bridge when we g...

Benjamin Bouvier (Jul 28 2020 at 16:39):

@Alex Crichton thanks! I'm worried about the reverting for TableType and other non-primitive types, though; this will require straight conversions from spidermonkey's types to wasmparser's types everywhere, or creating the converted types and caching them...

Alex Crichton (Jul 28 2020 at 16:39):

I think it should be cheap to do though? They're basically just Copy containers of types

Stream: cranelift

Topic: wasm validation inside of cranelift?

Alex Crichton (Jul 13 2020 at 17:56):

Alex Crichton (Jul 13 2020 at 17:57):

Alex Crichton (Jul 13 2020 at 17:57):

Alex Crichton (Jul 13 2020 at 17:58):

Alex Crichton (Jul 13 2020 at 17:58):

Alex Crichton (Jul 13 2020 at 17:59):

Alex Crichton (Jul 13 2020 at 17:59):

Chris Fallin (Jul 13 2020 at 18:12):

Chris Fallin (Jul 13 2020 at 18:13):

Alex Crichton (Jul 13 2020 at 18:14):

Alex Crichton (Jul 13 2020 at 18:14):

Alex Crichton (Jul 13 2020 at 18:14):

Chris Fallin (Jul 13 2020 at 18:17):

Chris Fallin (Jul 13 2020 at 18:19):

Alex Crichton (Jul 13 2020 at 18:19):

Alex Crichton (Jul 13 2020 at 18:20):

Chris Fallin (Jul 13 2020 at 18:20):

Benjamin Bouvier (Jul 15 2020 at 10:41):

Benjamin Bouvier (Jul 15 2020 at 10:42):

Benjamin Bouvier (Jul 15 2020 at 10:42):

Benjamin Bouvier (Jul 15 2020 at 10:43):

Alex Crichton (Jul 15 2020 at 13:46):

Alex Crichton (Jul 21 2020 at 22:45):

Yury Delendik (Jul 21 2020 at 22:46):

Alex Crichton (Jul 21 2020 at 22:46):

Alex Crichton (Jul 21 2020 at 22:47):

Benjamin Bouvier (Jul 22 2020 at 08:18):

Benjamin Bouvier (Jul 22 2020 at 08:21):

Benjamin Bouvier (Jul 24 2020 at 13:38):

Benjamin Bouvier (Jul 24 2020 at 14:03):

Alex Crichton (Jul 28 2020 at 14:06):

Alex Crichton (Jul 28 2020 at 14:07):

Benjamin Bouvier (Jul 28 2020 at 14:07):

Alex Crichton (Jul 28 2020 at 14:09):

Alex Crichton (Jul 28 2020 at 14:10):

Alex Crichton (Jul 28 2020 at 14:10):

Alex Crichton (Jul 28 2020 at 14:10):

Alex Crichton (Jul 28 2020 at 14:10):

Alex Crichton (Jul 28 2020 at 14:12):

Alex Crichton (Jul 28 2020 at 14:12):

Benjamin Bouvier (Jul 28 2020 at 14:13):

Alex Crichton (Jul 28 2020 at 14:14):

Benjamin Bouvier (Jul 28 2020 at 14:16):

Alex Crichton (Jul 28 2020 at 14:17):

Alex Crichton (Jul 28 2020 at 14:17):

Benjamin Bouvier (Jul 28 2020 at 14:21):

Benjamin Bouvier (Jul 28 2020 at 14:21):

Alex Crichton (Jul 28 2020 at 14:26):

Benjamin Bouvier (Jul 28 2020 at 14:28):

Alex Crichton (Jul 28 2020 at 14:33):

Alex Crichton (Jul 28 2020 at 14:33):

Benjamin Bouvier (Jul 28 2020 at 14:34):

Benjamin Bouvier (Jul 28 2020 at 14:35):

Alex Crichton (Jul 28 2020 at 14:37):

Alex Crichton (Jul 28 2020 at 14:37):

Alex Crichton (Jul 28 2020 at 16:35):

Benjamin Bouvier (Jul 28 2020 at 16:39):

Alex Crichton (Jul 28 2020 at 16:39):

Alex Crichton (Jul 28 2020 at 16:39):

Alex Crichton (Jul 28 2020 at 16:40):

Benjamin Bouvier (Jul 28 2020 at 16:42):