abrown opened PR #2914 from fcvt_from_uint
to main
:
When AVX512VL and AVX512F are available, use a single instruction
(VCVTUDQ2PS
) instead of a length 9-instruction sequence. This
optimization is a port from the legacy x86 backend.<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->
abrown requested jlb6740 for a review on PR #2914.
cfallin submitted PR review.
cfallin merged PR #2914.
Last updated: Nov 22 2024 at 16:03 UTC