abrown opened PR #2930 from shuffle-with-vperm
to main
:
When shuffling values from two different registers, the x64 lowering for
i8x16.shuffle
must first shuffle each register separately and then OR
the results with SSE instructions. WithVPERMI2B
, available in
AVX512VL and AVX512VBMI, this can be done in a single instruction after
the shuffle mask has been moved into the destination register. This
change usesVPERMI2B
for that case when the CPU supports it.<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown updated PR #2930 from shuffle-with-vperm
to main
.
abrown requested cfallin for a review on PR #2930.
abrown has marked PR #2930 as ready for review.
cfallin submitted PR review.
abrown merged PR #2930.
Last updated: Nov 22 2024 at 16:03 UTC