fitzgen opened PR #6608 from fitzgen:tail-no-callee-saves
to bytecodealliance:main
:
That is, mark all allocatable registers as clobbered by
tail
calls.Additionally, use (basically) all allocatable registers for passing arguments and returns. ("Basically" because we have to leave at least one register available to hold the callee address for indirect calls or else we'll get regalloc constraint errors.)
I could get three more argument registers on riscv if I refactored
compute_arg_locs
and/or did something similar to aarch64'scompute_arg_locs_tail
but it didn't seem worth the hassle.
fitzgen requested abrown for a review on PR #6608.
fitzgen requested wasmtime-compiler-reviewers for a review on PR #6608.
fitzgen requested wasmtime-default-reviewers for a review on PR #6608.
fitzgen requested jameysharp for a review on PR #6608.
fitzgen submitted PR review.
fitzgen submitted PR review.
fitzgen created PR review comment:
Note: this is just
clif-util
enabling thetrace-log
feature by default, not changingcranelift-codegen
or any other crate that is actually used as a dep anywhere.
jameysharp submitted PR review:
I don't much like the structure of this code; I know you've said you don't either. But this PR doesn't make it worse, so let's go with it.
I spent some time trying to figure out if there was more we could share between the register allocator configuration and these ABI bits, given that all allocatable registers are supposed to be clobbered in this ABI, but that didn't look straightforward.
As future work, I think there's something we can do with RA2's
PRegSet
to represent all of these different ABI-relevant sets of registers compactly; there's a popcount-like trick for finding the index of the Nth set bit that I think covers most of the uses we need here. (See the "Select the bit position (from the most-significant bit) with the given count (rank)" section of https://graphics.stanford.edu/~seander/bithacks.html.) But that's not necessary to land this.
jameysharp submitted PR review:
I don't much like the structure of this code; I know you've said you don't either. But this PR doesn't make it worse, so let's go with it.
I spent some time trying to figure out if there was more we could share between the register allocator configuration and these ABI bits, given that all allocatable registers are supposed to be clobbered in this ABI, but that didn't look straightforward.
As future work, I think there's something we can do with RA2's
PRegSet
to represent all of these different ABI-relevant sets of registers compactly; there's a popcount-like trick for finding the index of the Nth set bit that I think covers most of the uses we need here. (See the "Select the bit position (from the most-significant bit) with the given count (rank)" section of https://graphics.stanford.edu/~seander/bithacks.html.) But that's not necessary to land this.
jameysharp created PR review comment:
Would you add a comment here saying so?
fitzgen updated PR #6608.
fitzgen has enabled auto merge for PR #6608.
fitzgen merged PR #6608.
Last updated: Jan 24 2025 at 00:11 UTC