bjorn3 added the bug label to Issue #7999.
bjorn3 added the cranelift label to Issue #7999.
bjorn3 opened issue #7999:
.clif
Test Caseset opt_level=speed target x86_64 function u0:11(i8) -> i8 system_v { block0(v0: i8): v1 = uextend.i64 v0 v2 = imul_imm v1, 256 v3 = ireduce.i8 v2 return v3 }
Steps to Reproduce
- Compile reproducing test case with
opt_level=speed
.Expected Results
The function unconditionally returns 0.
Actual Results
The function gets optimized to returning the argument:
function u0:11(i8) -> i8 system_v { block0(v0: i8): return v0 }
Versions and Environment
Cranelift version or commit: 0.105 and 36fb62ca3bea7dd1456f12ead03084e2ceb48cda
Operating system: N/A
Architecture: x86_64
Extra Info
Found by @cbeuw in https://github.com/rust-lang/rustc_codegen_cranelift/issues/1460. Replacing
i8
withi32
returns the correct result.
fitzgen commented on issue #7999:
Thanks for the bug report!
Might be related to https://github.com/bytecodealliance/wasmtime/pull/7882 -- cc @elliott
alexcrichton commented on issue #7999:
Is this perhaps the wrong reduction? With rustc this code also optimizes to
return 0
:#[no_mangle] pub extern "C" fn a(a: u8) -> u8 { let a = u64::from(a); let a = a * 256; a as u8 }
although I fear I may be missing something subtle by accident
elliottt commented on issue #7999:
Doing a little bit of debugging, reverting #7882 doesn't fix the problem, but reverting #7719 as well does. Here's the assembly before reverting #7719:
0: 55 pushq %rbp 1: 48 89 e5 movq %rsp, %rbp 4: 48 89 f8 movq %rdi, %rax 7: 48 89 ec movq %rbp, %rsp a: 5d popq %rbp b: c3 retq
And here's the output after reverting both #7882 and #7719:
0: 55 pushq %rbp 1: 48 89 e5 movq %rsp, %rbp 4: 48 0f b6 c7 movzbq %dil, %rax 8: 48 c1 e0 08 shlq $8, %rax c: 48 89 ec movq %rbp, %rsp f: 5d popq %rbp 10: c3 retq
I think there's an unexpected interaction of rules happening here, because in isolation the rules we have for
imul x, 0
and the rules introduced in #7719 for reorderingireduce
andextend
should be sufficient to rewrite this to moving0
into%rax
.
alexcrichton commented on issue #7999:
Actual Results
The function gets optimized to returning the argument:
Aha that's what I was missing, actually reading! Apologies!
jameysharp closed issue #7999:
.clif
Test Caseset opt_level=speed target x86_64 function u0:11(i8) -> i8 system_v { block0(v0: i8): v1 = uextend.i64 v0 v2 = imul_imm v1, 256 v3 = ireduce.i8 v2 return v3 }
Steps to Reproduce
- Compile reproducing test case with
opt_level=speed
.Expected Results
The function unconditionally returns 0.
Actual Results
The function gets optimized to returning the argument:
function u0:11(i8) -> i8 system_v { block0(v0: i8): return v0 }
Versions and Environment
Cranelift version or commit: 0.105 and 36fb62ca3bea7dd1456f12ead03084e2ceb48cda
Operating system: N/A
Architecture: x86_64
Extra Info
Found by @cbeuw in https://github.com/rust-lang/rustc_codegen_cranelift/issues/1460. Replacing
i8
withi32
returns the correct result.
Last updated: Nov 22 2024 at 16:03 UTC