Does cranelift have a concept of UB/poison like LLVM does?
Nope, we've actually pretty explicitly chosen not to have UB
I understand wasm requires things like division by zero to trap. But splitting the check for zero divisor and division into separate instructions in the IR could expose more optimisation opportunites. Would just have to modify the fronted to generate code that checks for UB and traps
small asterisk around memory accesses -- if you access an invalid pointer, all bets are off
There are a whole host of benefits we get by not having UB, and I've tried to hold the line on this as much as possible: verification is simpler and subtle correctness bugs from unexpected optimization assumptions just can't happen
It's true that in some cases it will limit opportunities, but our goal is explicitly not to be LLVM
(for a somewhat related discussion, see recent comments about pointer provenance on GitHub -- I argued to treat a pointer as an integer, and define an access to a pointer as valid as long as the integer value is valid, irrespective of its provenance; because the alternative, where certain kinds of derivations are UB, just causes so many broken assumptions that it's impossible to be really sure about optimization correctness)
Fuzzing and other similar testing is another example of the benefits we get from UB: by giving one particular meaning to any given CLIF, we can test via equality of outputs, rather than some sort of refinement or one-possible-valid-behavior predicate
every once in a while this comes up when someone sees an optimization opportunity; maybe I should write an RFC if it would help align everyone :-)
Thank you. Would be interesting to see how far we can push perf without compromising on UB
IIRC there was a paper where they disabled UB in LLVM and there was only a few % slowdown in most benchmarks
I do suspect that there are ways to design abstractions that expose some of the optimization opportunities without UB, though. One way to see UB is under-capture of invariants in the IR's type system. So for example a "non-zero integer type" as the output of a "trap if divisor is 0" operator could then be fed into a non-trapping div
ah, that's interesting, I haven't seen concrete numbers; that's not bad at all
Chris Fallin said:
I do suspect that there are ways to design abstractions that expose some of the optimization opportunities without UB, though. One way to see UB is under-capture of invariants in the IR's type system. So for example a "non-zero integer type" as the output of a "trap if divisor is 0" operator could then be fed into a non-trapping div
Something like a refinement type system?
Yeah, in the most general sense :-) I don't know if I would want something like fully general refinement predicates in CLIF
though maybe actually at least integer-range refinements would make sense
kmeakin has marked this topic as resolved.
kmeakin said:
IIRC there was a paper where they disabled UB in LLVM and there was only a few % slowdown in most benchmarks
Do you have a link, perchance? I'd be very interested to see how they did that.
Last updated: Jan 24 2025 at 00:11 UTC