Stream: git-wasmtime

Topic: wasmtime / issue #4414 implement wasmtime::component::fla...


view this post on Zulip Wasmtime GitHub notifications bot (Jul 07 2022 at 23:34):

github-actions[bot] commented on issue #4414:

Subscribe to Label Action

cc @peterhuene

<details>
This issue or pull request has been labeled: "wasmtime:api"

Thus the following users have been cc'd because of the following labels:

To subscribe or unsubscribe from this label, edit the <code>.github/subscribe-to-label.json</code> configuration file.

Learn more.
</details>

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 01:12):

jameysharp commented on issue #4414:

First, some quick things before the complicated notes:

So a macro invocation might look like:

flags! {
    FooSet => enum Foo {
        #[component(name = "foo-bar-baz")]
        A,
        B,
    }
}

Now: After a lot of thought, I suspect that implementing flags can be quite a bit simpler than this, given a slightly different perspective.

The first observation is that WebAssembly specifies that multi-byte values are stored in linear memory in little-endian order. So even though the canonical ABI specifies flags using 8, 16, or 32-bit quantities, all flags types can be treated uniformly as byte-arrays in load and store. Flag indexing works out the same way regardless. That in turn means we can implement those methods as pure memory copies, as long as we define the Rust types as minimum-length byte arrays. (For anything more than 2 bytes, store also should write zeros in the last -n&3 bytes.)

The next observation is that wasmtime's ValRaw type, used when values are lowered to or lifted from the wasm stack, doesn't distinguish types smaller than 32 bits. So if the Rust-side definition of a flags type is a byte array, then no matter how many flags there are, it's correct to use u32::from_le_bytes or u32::to_le_bytes to map every four-byte chunk into a ValRaw, padding the last chunk with zeros if necessary.

I'm wondering if we could go as far as having a single Flags type provided in the wasmtime crate, with type parameters for the array of ValRaw used as its ComponentType::Lower type, and for the array of bytes used as its Rust representation. An additional type parameter could refer to a type with a type-level constant containing the list of flag names, for use during type-checking and in Debug. (It'd be nice to compute some type parameters from others, but as far as I can tell Rust can't currently do that. I'd list what I tried but this note is already too long.)

Then all of ComponentType, Lift, Lower, Eq, Debug, BitOr, etc can be implemented once for the generic Flags type. The flags macro just needs to emit a type alias to Flags with the appropriate type parameters filled in. I think at that point it might be simple enough that we can write it as a macro_rules macro.

Consistently using byte arrays means there's nothing special about the 1-byte or 2-byte encodings when deciding what mask to use for the last few flags. If number of flags n mod 8 is 0, then no mask is necessary; otherwise, the last byte should be masked with !(0xFFu8 << (n % 8)). Any function that needs to know the number of flags can get it from the length of the list of names, available from the third type-parameter on Flags.

I believe you can also write impls for specific instantiations of Flags to declare things like { const A: FlagType = ...; } without running afoul of Rust's orphaned impl rules. That said, I think asking the user of the flags! macro to provide both an enum Foo type name and a FooSet type name will work out better.

This was a lot, sorry. I hope it made some sense and I want to hear what you think of it; I'm not certain it's better, I just think it has good odds of working out well.

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 13:11):

alexcrichton commented on issue #4414:

Personally I don't feel the same way about Foo and FooSet. The flags type in the component model I see as basically analagous to the FooSet idea so I'm not sure if it also justifies the existence of Foo. That sort of feels like too much boilerplate to me. That being said I don't really feel too strongly.

I'd also caution to avoid going too crazy with the representation just yet. I don't feel like we've got a real handle on what performance pitfalls are, how this will all be used, and how it all integrates into a final experience. In that sense I think it might be premature to sink too much design effort into a small niche of the component model when it's not clear if it'll pay off or be worth it in the end.

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 14:11):

dicej commented on issue #4414:

@jameysharp You've clearly put a lot more thought into this than I have, and I think you've got some great ideas. I don't use bit flags much in day-to-day application-level code, so it's not a topic I have any strong opinions about.

My overall goal is to get the component model implemented in wasmtime and other key projects ASAP, so I'm biased in favor of going with what we have for now (possibly with some tweaks where appropriate, such as always deriving both Lift and Lower) and refining it later once we have some real world experience, like Alex suggests.

I'll go ahead now and remove the derive syntax from the macro, since that's easy.

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 17:00):

jameysharp commented on issue #4414:

Quick detail first: I don't think the mask trick I described works unless you guard against the case where len%bits==0. In that case you could avoid masking entirely, or there are several other ways to do it, but without bigger changes to code structure it's a little tricky. Perhaps there should be tests for exactly 8/16/32 flags, as I believe the current version will panic in debug builds for those cases. Feel free to go back to the loops if you don't want to think about it.

Okay, I can see I dove into technical details without making my motivations clear. I was trying to address two high-level concerns: one is API design, and the other is ease of maintenance of the implementation.

At the moment I'm mostly concerned about implementation complexity. Procedural macros are complicated. I don't see any way around using them for #[deriving]. But this PR adds half as much code to component-macro to support flags as that crate has for all the other cases put together. Since we aren't using #[deriving] for flags, I think we can put most of the complexity in library code instead, and either avoid proc-macros entirely or at least add very little code there.

Even if we stick entirely with proc-macros, much of the new code is due to duplication around the size 1/2/4+ split, as well as trying to reuse the deriving implementation for records. My motivation for suggesting byte arrays is that it eliminates a bunch of special cases, as well as minimizing the amount of generated code since you can just loop over the flags instead.

I also want to be clear that I hadn't come up with an alternative until I spent time understanding what you've done in this PR and why. I think the effort you put into this was important for moving component model implementation forward, regardless of whether we merge it as-is—which we may do.

Finally, I don't mind dropping the API design questions. Yes, we should bias toward getting something merged. The syntax in the flags! macro isn't particularly important because we expect it to be auto-generated by wit-bindgen, not maintained by hand. I'm a little more concerned about API design for how the host can interact with these flag-sets, but sure, we're not promising any stable API here so we can futz with it later.

So with all that said: @dicej, please fix the mask generation; and @alexcrichton, I'd appreciate your take on implementation complexity.

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 17:47):

alexcrichton commented on issue #4414:

Personally I agree that we don't want the implementation complexity too high right now in the abstract. The component model is still very early-days and we should be focusing primarily on correctness rather than speed for now (until we get to the point where we have a concrete thing to benchmark). Having skimmed this over this seems fine to me complexity-wise, but I'm also coming from the other 10k+ lines of code related to the component model elsewhere so my calibration is not always the same as others'

view this post on Zulip Wasmtime GitHub notifications bot (Jul 12 2022 at 23:46):

jameysharp commented on issue #4414:

Yeah, you're both right: I'll go ahead and merge this. I might go try implementing my proposal afterward, but yes, it's better to merge this than not.

Also, thanks for the comprehensive mask fix and additional tests, @dicej! This is all strong work, like your earlier PRs.


Last updated: Jan 24 2025 at 00:11 UTC