cfallin opened PR #4182 from regalloc2-no-scratch-reg
to main
:
RA2 recently removed the need for a dedicated scratch register for
cyclic moves (bytecodealliance/regalloc2#51). This has moderate positive
performance impact on function bodies that were register-constrained, as
it means that one more register is available. In Sightglass, I measured
+5-8% onblake3-scalar
, at least among current benchmarks.<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->
cfallin requested fitzgen for a review on PR #4182.
fitzgen submitted PR review.
cfallin updated PR #4182 from regalloc2-no-scratch-reg
to main
.
cfallin updated PR #4182 from regalloc2-no-scratch-reg
to main
.
cfallin merged PR #4182.
Last updated: Nov 22 2024 at 16:03 UTC