Hi all -- I've put together a patch stack to start the review process for the new backend and the ARM64 machine-specific use of it. See https://github.com/cfallin/wasmtime/tree/arm64-merge for the 11 commits. This is a rollup of the ~4 months of work on the arm64
side-branch (excluding the incomplete x64 backend). It's ~17KLoC of code.
Questions:
Who wants to review? I recall @Dan Gohman volunteering a few weeks ago; @Benjamin Bouvier and @Julian Seward and I also are happy to review each others' code (i.e., code we didn't write), if others are fine with that.
Would it be better to do this in a single PR, keeping the 11-commit stack just to ease review partitioning, or submit a dependent chain of 11 PRs?
My thought is that one PR makes sense. This will save you a lot of work over juggling a long chain of dependent PRs. It'll be a big review, but we can focus on making sure we understand the main shape of things rather than probing into every detail, and then, in return for us having saved you all that juggling work, you can be responsive to people probing into the details and asking questions after it lands :-)
How does that sound?
That sounds eminently reasonable!
https://github.com/bytecodealliance/wasmtime/pull/1494 -- review away!
@Chris Fallin , excellent work. I am of course happy to review stuff.
Somewhat related question: what is now the situation regarding further commits to your branch? I ask because as I work to speed up the register allocator, I am beginning to see inefficiencies on the CL side (in the new code) that also need to be fixed. I could make patches for those and put them off to one side until after reviewing/merging is done, if that would be helpful.
@Julian Seward great! I think that Dan is going to review at a high level (as he indicated in the #cranelift channel) but please feel free to go over anything in more detail
Regarding the branch -- I don't want to block ongoing work, so I think we can keep pushing to arm64
until the PR lands; and then we can rebase whatever has happened in the meantime and review as additional PRs
(The alternative is to stack up a bunch of dependent PRs which is especially messy as reviews change things -- don't really want to go there)
Yay \o/
Two things that come to mind:
Co-authored-by: Julian Seward <jseward@acm.org> Co-authored-by: Joey Gouly <joey.gouly@arm.com> Co-authored-by: Benjamin Bouvier <public@benj.me>
Then Github is able to pattern-match this and mark commits as being co-authored, which is nice to better attribute credit.
@Chris Fallin I'm just realizing that this design of parameterizing on the instruction means that it looks like the entire mach backend gets monomorphized for every target, is that right?
Yes, that's right; we made this decision early on to avoid dynamic dispatch
That does work out well for the JIT case where just one target is active, of course, so it's not necessarily a bad idea.
We figured that in the common case, the compiler would be built with support for one target, and in the uncommon case, the code that gets monomorphized is ideally not a large portion of the total code (most of the bulk is in the machine backend and in the target-indep passes, it seems)
Yeah, and in any configuration linking in multiple targets (rustc?), there's going to be a lot of target-specific code anyway.
This may make things harder if we pursue the crazy idea I mentioned earlier of replacing clif IR with vcode IR entirely, because more code would be monomorphized then, but it may still be manageable.
the VCode methods at least would be monomorphized twice, but e.g. the lowering wouldn't be (I imagine we'd conceptually have fn lower<FromInst: VCodeInst, ToInst: VCodeInst>(...)
and just one (from, to) pair in the common case)
so yeah, maybe not too bad.
:tada: :tada: :tada:
Last updated: Jan 24 2025 at 00:11 UTC