wasmtime / issue #4778 [fuzz] Add SIMD to single-instruct... · git-wasmtime

Stream: git-wasmtime

Topic: wasmtime / issue #4778 [fuzz] Add SIMD to single-instruct...

Wasmtime GitHub notifications bot (Aug 24 2022 at 23:43):

There are a couple of issues to resolve before merging this:

someone (@alexcrichton?) had mentioned previously that it would be good to expand the number of V128 known values so that we can find bugs faster. There are a lot of known values that could be interesting, though: 6 kinds of V128 (i8x16, i16x8, ..., f64x2) x up to 8 values (NaN, Inf, -Inf, ..., 1, MAX). Any suggestions on a nice way to generate all of these?

when run on this branch, the differential fuzz target will eventually find NaN canonicalization issues for the V128 values--e.g., the spec will canonicalize them but the single-instruction generator does not know how to do this. I'm wondering how best to tackle this:
a. try to add canonicalization to single-inst modules to match wasm-smith; I haven't looked into this much... does wasm-smith even canonicalize V128s?
b. try to teach the impl PartialEq for DiffValue to ignore NaNs for V128s; this already happens for F32 and F64, but for V128 this is trickier because we don't know how to split up the lanes (is it an F32x4? No, an I8x16?) and there is actually no easy way to know this from the WebAssembly function signature itself--the information is lost. But perhaps we could first try ==, then F32x4 NaNs, then F64x2 NaNs, then fail (something like that).

Thoughts on these two questions?

Wasmtime GitHub notifications bot (Aug 24 2022 at 23:45):

abrown edited a comment on issue #4778:

There are a couple of issues to resolve before merging this:

someone (@alexcrichton?) had mentioned previously that it would be good to expand the number of V128 known values so that we can find bugs faster. There are a lot of known values that could be interesting, though: 6 kinds of V128 (i8x16, i16x8, ..., f64x2) x up to 8 values (NaN, Inf, -Inf, ..., 1, MAX). Any suggestions on a nice way to generate all of these?

when run on this branch, the differential fuzz target will eventually find NaN canonicalization issues for the V128 values--e.g., the spec will canonicalize them but the single-instruction generator does not know how to do this. I'm wondering how best to tackle this:
a) try to add canonicalization to single-inst modules to match wasm-smith; I haven't looked into this much... does wasm-smith even canonicalize V128s?
b) try to teach the impl PartialEq for DiffValue to ignore NaNs for V128s; this already happens for F32 and F64, but for V128 this is trickier because we don't know how to split up the lanes (is it an F32x4? No, an I8x16?) and there is actually no easy way to know this from the WebAssembly function signature itself--the information is lost. But perhaps we could first try ==, then F32x4 NaNs, then F64x2 NaNs, then fail (something like that).

Thoughts on these two questions?

Wasmtime GitHub notifications bot (Aug 25 2022 at 00:06):

github-actions[bot] commented on issue #4778:

Subscribe to Label Action

cc @fitzgen

<details>
This issue or pull request has been labeled: "fuzzing"

Thus the following users have been cc'd because of the following labels:

fitzgen: fuzzing

To subscribe or unsubscribe from this label, edit the <code>.github/subscribe-to-label.json</code> configuration file.

Learn more.
</details>

Wasmtime GitHub notifications bot (Aug 25 2022 at 00:08):

jameysharp commented on issue #4778:

There are a couple of issues to resolve before merging this:

someone (@alexcrichton?) had mentioned previously that it would be good to expand the number of V128 known values so that we can find bugs faster. There are a lot of known values that could be interesting, though: 6 kinds of V128 (i8x16, i16x8, ..., f64x2) x up to 8 values (NaN, Inf, -Inf, ..., 1, MAX). Any suggestions on a nice way to generate all of these?

To make sure I'm looking at the right thing: you're talking about DiffValue::arbitrary_of_type in crates/fuzzing/src/generators/value.rs, right?

I'm thinking that arbitrary_of_type could first pick a vector kind (f32x4 etc), then call itself once for each lane, requesting the right lane type. For example, 4 calls to DiffValue::arbitrary_of_type(u, F32), then pack the bits from each call into a u128. Is that too simplistic? I guess it doesn't quite work for vectors with lanes narrower than 32 bits.

when run on this branch, the differential fuzz target will eventually find NaN canonicalization issues for the V128 values--e.g., the spec will canonicalize them but the single-instruction generator does not know how to do this. I'm wondering how best to tackle this:

The cranelift-fuzzgen target turns on a "nan_canonicalization" pass inside Cranelift. I see there's a Wasmtime config option, config.cranelift_nan_canonicalization(true), that should do the same thing. Does that help here?

Wasmtime GitHub notifications bot (Aug 25 2022 at 07:08):

afonso360 commented on issue #4778:

The cranelift-fuzzgen target turns on a "nan_canonicalization" pass inside Cranelift. I see there's a Wasmtime config option, config.cranelift_nan_canonicalization(true), that should do the same thing. Does that help here?

If I understand this correctly, this fuzz target can fuzz against engines other than Wasmtime, right? They might not have an equivalent nan canonicalization pass. So we would have to run that pass before sending the wasm code to the engines.

Wasmtime GitHub notifications bot (Aug 25 2022 at 07:09):

afonso360 edited a comment on issue #4778:

The cranelift-fuzzgen target turns on a "nan_canonicalization" pass inside Cranelift. I see there's a Wasmtime config option, config.cranelift_nan_canonicalization(true), that should do the same thing. Does that help here?

If I understand this correctly, this fuzz target can fuzz against engines other than Wasmtime, right? They might not have an equivalent nan canonicalization pass. So we would have to run an equivalent pass in wasm before sending the wasm code to the engines.

Wasmtime GitHub notifications bot (Aug 25 2022 at 17:29):

fitzgen commented on issue #4778:

wasm-smith has an option to canonicalize NaNs today, not sure if it is complete for SIMD stuff or not tho: https://github.com/bytecodealliance/wasm-tools/blob/de47b042d93a6e28208e9a83c10304b3d64db625/crates/wasm-smith/src/core/code_builder.rs#L867

Wasmtime GitHub notifications bot (Aug 25 2022 at 19:12):

abrown commented on issue #4778:

Yeah, maybe we could update which instructions can have their outputs canonicalized in wasm-smith (thanks @fitzgen for the link, was wondering about that) but the other big issue is that we would have to build similar canonicalization infrastructure in the single-instruction module generator for all the same instructions.

I'm actually thinking through option B above which is to implement fail-overs in PartialEq for DiffValue--why? The first N bits of an FP NaN are set: for f32, it's the first 10 bits (0b0_11111111_1... sign, exponent, first bit of mantissa)1. If we compare two V128s and they do not match because a single F32 lane contains a NaN with differing payload, then the bits of those V128s can differ by up to 22 bits / 128 bits = ~17%. If in fact the difference in the V128s was not due to NaN canonicalization, would we really want to take that 17% risk of ignoring a bug?

So now I'm leaning towards canonicalizing certain instructions in the single-instruction generator as done here.

<a name="nan">#1</a>: as a reminder, after the first 1 bit in the mantissa, canonical NaNs are filled with 0s whereas arithmetic NaNs, the problem here, are any arbitrary bits, see the Wasm spec+

Wasmtime GitHub notifications bot (Aug 25 2022 at 19:37):

abrown edited a comment on issue #4778:

Yeah, maybe we could update which instructions can have their outputs canonicalized in wasm-smith (thanks @fitzgen for the link, was wondering about that) but the other big issue is that we would have to build similar canonicalization infrastructure in the single-instruction module generator for all the same instructions.

I'm actually thinking through option B above which is to implement fail-overs in PartialEq for DiffValue--why? The first N bits of an FP NaN are set: for f32, it's the first 10 bits (0b0_11111111_1... sign, exponent, first bit of mantissa)1. If we compare two V128s and they do not match because a single F32 lane contains a NaN with differing payload, then the bits of those V128s can differ by up to 22 bits / 128 bits = ~17%. If in fact the difference in the V128s was not due to NaN canonicalization, would we really want to take that 17% risk of ignoring a bug?

So now I'm leaning towards canonicalizing certain instructions in the single-instruction generator as done here.

<a name="nan">#1</a>: as a reminder, after the first 1 bit in the mantissa, canonical NaNs are filled with 0s whereas arithmetic NaNs, the problem here, are any arbitrary bits, see the Wasm spec).

Wasmtime GitHub notifications bot (Aug 25 2022 at 19:37):

abrown edited a comment on issue #4778:

Yeah, maybe we could update which instructions can have their outputs canonicalized in wasm-smith (thanks @fitzgen for the link, was wondering about that) but the other big issue is that we would have to build similar canonicalization infrastructure in the single-instruction module generator for all the same instructions.

I'm actually thinking through option B above which is to implement fail-overs in PartialEq for DiffValue--why? The first N bits of an FP NaN are set: for f32, it's the first 10 bits (0b0_11111111_1... sign, exponent, first bit of mantissa)1. If we compare two V128s and they do not match because a single F32 lane contains a NaN with differing payload, then the bits of those V128s can differ by up to 22 bits / 128 bits = ~17%. If in fact the difference in the V128s was not due to NaN canonicalization, would we really want to take that 17% risk of ignoring a bug?

So now I'm leaning towards canonicalizing certain instructions in the single-instruction generator as done here.

<a name="nan">#1</a>: as a reminder, after the first 1 bit in the mantissa, canonical NaNs are filled with 0s whereas arithmetic NaNs, the problem here, are any arbitrary bits, see the Wasm spec.

Wasmtime GitHub notifications bot (Aug 25 2022 at 19:38):

abrown edited a comment on issue #4778:

Yeah, maybe we could update which instructions can have their outputs canonicalized in wasm-smith (thanks @fitzgen for the link, was wondering about that) but the other big issue is that we would have to build similar canonicalization infrastructure in the single-instruction module generator for all the same instructions.

I'm actually thinking through option B above which is to implement fail-overs in PartialEq for DiffValue--why? The first N bits of an FP NaN are set: for f32, it's the first 10 bits (0b0_11111111_1... sign, exponent, first bit of mantissa)1. If we compare two V128s and they do not match because a single F32 lane contains a NaN with differing payload, then the bits of those V128s can differ by up to 22 bits / 128 bits = ~17%. If in fact the difference in the V128s was not due to NaN canonicalization, would we really want to take that 17% risk of ignoring a bug?

So now I'm leaning towards canonicalizing certain instructions in the single-instruction generator as done here.

<a name="nan">#1</a>: as a reminder, after the first 1 bit in the mantissa, canonical NaNs are filled with 0s whereas arithmetic NaNs (which are the problem here) can contain any arbitrary bits. Also see the Wasm spec.

Wasmtime GitHub notifications bot (Aug 27 2022 at 00:59):

jameysharp commented on issue #4778:

Let me see if I have this right: a wasm-level NaN canonicalization pass would append a type-specific fixed sequence of instructions immediately after every instruction that pushes a newly-computed floating-point value on the stack, and you can tell which sequence to use by looking only at the instruction, without any surrounding context.

That does sound better than ignoring possible-NaN differences at the end, after all type information is lost.

Wasmtime GitHub notifications bot (Aug 27 2022 at 01:17):

abrown commented on issue #4778:

Yeah, that's right.

Wasmtime GitHub notifications bot (Aug 30 2022 at 14:19):

alexcrichton commented on issue #4778:

I think it's ok to expand the set of interesting v128 values later, and I think it's ok to be a bit verbose and just list them all out rather than procedurally generating them (basically I don't have a better idea).

Otherwise though with NaN issues I think the best solution is to add code to wasm-smith to do the same canonicalization that happens for f32 and f64. I think the sequences of instructions are basically going to be the same, just for vectors instead.

Wasmtime GitHub notifications bot (Sep 06 2022 at 21:31):

abrown commented on issue #4778:

Seems like a good start to me! I personally think that the nan canonicalization code would go better in wasm-smith itself along the lines of f32/f64 canonicalization that's already in there. I'm ok landing this first though and updating that afterwards.

I don't get what you mean. How would that work?

Wasmtime GitHub notifications bot (Sep 06 2022 at 21:39):

alexcrichton commented on issue #4778:

Wasm-smith based canonicalication is currently here. Actually reading over that it looks like there's already affordances for simd-based float operations. If your differential fuzzing here found issues with nans though then it may actually just be a case of extending the opcode listings there to canonicalize on a few more operations.

Wasmtime GitHub notifications bot (Sep 06 2022 at 21:54):

abrown commented on issue #4778:

Ah, no, the NaN canonicalization problem I observed was with a SingleInstModule, which wasm-smith does not generate. So my solution was to bring over that same NaN canonicalization logic over from wasm-smith to SingleInstModule. For now, I have not observed a scalar failure though it seems possible one day that this could happen (which is why I ported the scalar canonicalization as well); when it happens we can enable the same scalar instructions that you pointed to. So only SIMD canonicalization is actually enabled in c6bb6c0.

When I run this locally with V8 disabled (still haven't figured out what is going on with the segfaults there), I have not seen a failure yet:
$ ALLOWED_ENGINES=-v8 cargo +nightly fuzz run -s none differential
...
=== Execution rate (1152302 successes / 2313000 attempted modules): 49.82% ===
        wasmi: 8.06%, spec: 8.46%, wasmtime: 83.48%, v8: 0.00%
        wasm-smith: 72.32%, single-inst: 27.68%
I guess I'll merge it and we can see if anything is found by oss-fuzz.

Wasmtime GitHub notifications bot (Sep 06 2022 at 22:25):

alexcrichton commented on issue #4778:

Oh wow sorry I completely forgot that this isn't wasm-smith related at all. Unsure how I missed that for so long... Disregard me and this is definitely good to land :+1:

Last updated: Apr 17 2025 at 02:30 UTC