andrewmd5 edited issue #8247:
Thanks for filing a feature request! Please fill out the TODOs below.
Feature
Compile the C API and produce ARM64 builds for Windows
Benefit
Will unblock https://github.com/bytecodealliance/wasmtime-dotnet/issues/298 in the wasmtime-dotnet package
Not sure if the wheel exist for this yet; in terms of infrastructure cross compilation on Github Actions should be possible, but if necessary I’m happy to deploy some new self-hosted runners to power this feature.
cfallin commented on issue #8247:
This depends on #4992 -- a little bit of core runtime functionality (trap handling, unwind info generation) necessary for this OS/architecture pair. If you're willing to work on this, we'd be happy to review a PR!
peterhuene edited issue #8247:
Feature
Compile the C API and produce ARM64 builds for Windows
Benefit
Will unblock this issue in the wasmtime-dotnet package
Not sure if the wheel exist for this yet; in terms of infrastructure cross compilation on Github Actions should be possible, but if necessary I’m happy to deploy some new self-hosted runners to power this feature.
andrewmd5 commented on issue #8247:
This depends on #4992 -- a little bit of core runtime functionality (trap handling, unwind info generation) necessary for this OS/architecture pair. If you're willing to work on this, we'd be happy to review a PR!
Thank you for linking the relevant issue; I’ll take a look and see about submitting a PR.
alexcrichton added the wasmtime:platform-support label to Issue #8247.
dpaoliello commented on issue #8247:
I've been trying to get
rustc_codegen_cranelift
working with Windows ARM64, but I've run into an issue with alignment.My work-in-progress branch is available at <https://github.com/dpaoliello/wasmtime/tree/arm64wip>
Current, when I run
y build
, a bunch of linker errors from link.exe complaining about the alignment of symbols:std-0891cada1b439ffb.dq2m4pkr5rfjx0xrog6ne1sfh.rcgu.o : error LNK2048: relocation PAGEOFFSET_12L targeting 'memcpy' (0019EE84) is invalid for the instruction (F9400084 at RVA 000D6C84) at section 0x1 offset 0x137C, due to bad alignment of offset to target (E84); expected to be 8 bytes aligned
Import thing from that output is that
memcpy
is being located at0019EE84
, which is not 8 byte aligned.I've tried setting the function alignment to 8: <https://github.com/dpaoliello/wasmtime/blob/e7184160fe909864c767177de17729f876c0da60/cranelift/codegen/src/isa/aarch64/inst/mod.rs#L1184>
And the symbol alignment to 8:
<https://github.com/dpaoliello/wasmtime/blob/e7184160fe909864c767177de17729f876c0da60/cranelift/codegen/src/isa/mod.rs#L430>But neither seems to have fixed this - any idea what I'm doing wrong?
alexcrichton commented on issue #8247:
I think
memcpy
would be defined in the libc-equivalent-windows-has, which might be why changing Cranelift's function/symbol alignment didn't work? How sure are you thememcpy
function itself is created by Cranelift?If it's not created by Cranelift this might be something where we're generating the wrong relocation against
memcpy
perhaps? Where the one we're generating requires 8-byte alignment but we should be using something else that doesn't require 8-byte alignment?
dpaoliello commented on issue #8247:
It's not
memcpy
specifically, I'm also seeing a bunch of Win32 function, so it's likely any external symbol that the obj is referencing. I'm not familiar with how external symbols are represented in obj files, or how cranelift places them there, so I'll have to dig into this further when I have time.
dpaoliello commented on issue #8247:
Ok, I finally understand what's happening here.
When emitting a call, wasmtime emits it as
LoadExtName
then the call indirect:
https://github.com/bytecodealliance/wasmtime/blob/e56ffd77f1fb2240e163b7f840f8c4e728c98434/cranelift/codegen/src/isa/aarch64/abi.rs#L1041-L1045
LoadExtName
always get lowered and the page base + page offset reloc:
https://github.com/bytecodealliance/wasmtime/blob/e56ffd77f1fb2240e163b7f840f8c4e728c98434/cranelift/codegen/src/isa/aarch64/inst/emit.rs#L3165-L3185That type of reloc needs to be 8 byte aligned, however functions (at least on Windows, not sure about other platforms) aren't guaranteed to be 8 byte aligned, thus the linker complains.
When I look at code generated by LLVM, I see branch26 relocs being emitted for called functions, which seems to be generated by this:
https://github.com/llvm/llvm-project/blob/18ee00323f5fc22d32a74b636fcac84e697241f3/llvm/lib/Target/AArch64/MCTargetDesc/AArch64MCCodeEmitter.cpp#L459-L478I'm not entirely sure how to handle implementing this in wasmtime with the abstraction between abi.rs and emit.rs.
dpaoliello commented on issue #8247:
And, just to confound things, sometimes LLVM will generate a page base + page offset for a call: <https://godbolt.org/z/3c7ce1vP4>
alexcrichton commented on issue #8247:
Is there a relocation that doesn't need to be 8-byte aligned we could use? I'm a bit surprised by that godbolt link because
example::call_yep::h3f70d43527bee11a: 0: str x30, [sp, #-16]! 4: adrp x8, __imp_yep 8: ldr x8, [x8, :lo12:__imp_yep] c: blr x8 10: adrp x8, __imp_X 14: ldr x8, [x8, :lo12:__imp_X] 18: ldr w8, [x8] 1c: madd w0, w8, w8, w0 20: ldr x30, [sp], #16 24: ret
Here
adrp
falls on both an 8-byte and non-8-byte aligned boundary (0x4/0x10). How does that work if it's required to be 8-byte aligned? Or is the assembler hiding anop
instruction or something like that?If we need these relocations to be 8-byte aligned in Cranelift it would probably look like:
- Guarantee all functions are 8-byte-aligned (I think some are 4-byte-aligned right now)
- When emitting this relocation if the current offset is 4-byte aligned then emit a nop to make it 8-byte aligned.
I'm mostly surprised that LLVM doesn't seem to be doing anything with nops but from what you're saying it should work?
dpaoliello commented on issue #8247:
Sorry, bit of confusion, the target of the reloc needs to be 8 byte aligned, not the reloc or consuming instruction.
alexcrichton commented on issue #8247:
Aha that makes more sense! (I should also read more carefully...)
This might be as simple as updating this value? That could perhaps have a comment for now saying only Windows so far requires 8-byte alignment but it's easier to bump all platforms to 8-byte so that's why it's unconditionally a minimum of 8 for now.
dpaoliello commented on issue #8247:
That helps: it at least means that any function in the current compilation can be the target of a reloc.
But I'm also seeing the linker complain about Win32 functions and parts of the CRT.
alexcrichton commented on issue #8247:
Oh dear sorry I'm being particularly slow at understanding this, you've already told me that historically as well...
Is this perhaps something related to dllimport or something like that? Where memcpy should be imported via dllimport and some slightly different form of relocation is something the linker handles when fixing it up? Otherwise I'll probably step aside as I'm probably out of my depth here...
Last updated: Jan 24 2025 at 00:11 UTC