iximeow opened Issue #1379:
It looks like our SIMD encodings waver between permitting and not permitting the REX prefix: compare uses of
enc_32_64andenc_bothin x86/encoding.rs.As a result, vector operations using
r8-r15as parts of a memory operand, orxmm8-xmm15as a register operand, get rejected when searching for encodings and, I believe, result in Cranelift not using those xmm registers entirely in some circumstances (we're missing a rex-friendlyMOVUPSencoding so I don't think we can encode a spill ofxmm8-xmm15).I ran across this trying to emit
movups xmm7, [r11]when restoring SIMD registers in Windows prologues. This currently results in a constraint violation panic when compiling. I believe this is because I specified the location after inserting the instruction, so Cranelift picked theFPR8/GPR8-constrained encoding, and I just gave it an impossible operand, whereas regalloc would have been unable to user11as an operand and forced shuffling until a fitting register were available.I'm pretty sure we can make the blanket change from
enc_32_64toenc_bothfor the region I linked above. However, I don't know themaybe_isapvariants, and I figure it'd be better to confirm this before changing stuff around.cc @abrown and @bnjbvr as people who've recently touched these things, I think?
abrown commented on Issue #1379:
For some history, I opened #1090 to track this and then tried something in #1149 that I later abandoned. I think you're right about
enc_32_64 -> enc_bothbut I have this nagging feeling that some of the instructions might use a REX bit to distinguish between different instructions (e.g. an 64x2 vs a 32x4)--but I can't recall which have this behavior and which not. Then again, I could be wrong about that but it bears more investigation. I can take a look next week sometime unless you want to do it now?
iximeow commented on Issue #1379:
Oh! I tried looking for open issues and missed that one. I'd have bumped that instead if I saw it.
You're right - I don't think any lane choices are made by REX prefix bits but at least
cvtsi2sddoes change operands:f2 0f 2a c0: cvtsi2sd xmm0, eax f2 4f 0f 2a c0: cvtsi2sd xmm0, r8I entirely support being careful here :) I just wanted to note the issue lest I forget to mention, if you can take a look next week that's way ahead of when I can.
alexcrichton transferred Issue #1379:
It looks like our SIMD encodings waver between permitting and not permitting the REX prefix: compare uses of
enc_32_64andenc_bothin x86/encoding.rs.As a result, vector operations using
r8-r15as parts of a memory operand, orxmm8-xmm15as a register operand, get rejected when searching for encodings and, I believe, result in Cranelift not using those xmm registers entirely in some circumstances (we're missing a rex-friendlyMOVUPSencoding so I don't think we can encode a spill ofxmm8-xmm15).I ran across this trying to emit
movups xmm7, [r11]when restoring SIMD registers in Windows prologues. This currently results in a constraint violation panic when compiling. I believe this is because I specified the location after inserting the instruction, so Cranelift picked theFPR8/GPR8-constrained encoding, and I just gave it an impossible operand, whereas regalloc would have been unable to user11as an operand and forced shuffling until a fitting register were available.I'm pretty sure we can make the blanket change from
enc_32_64toenc_bothfor the region I linked above. However, I don't know themaybe_isapvariants, and I figure it'd be better to confirm this before changing stuff around.cc @abrown and @bnjbvr as people who've recently touched these things, I think?
Last updated: Dec 06 2025 at 06:05 UTC