ArtBlnd opened PR #5036 from x86_64-float-bitops1
to main
:
<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->This patch implements float bitops on x86_64 using SSE instructions.
@afonso360
ArtBlnd edited PR #5036 from x86_64-float-bitops1
to main
:
<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->This patch implements float bitops on x86_64 using SSE instructions.
@afonso360
- [ ] Check if better single slot bitops available on x86_64. (which has better latency or throughput?...)
- [ ] Make single slot mask for
f32
,f64
instead ofvector_all_ones
ArtBlnd edited PR #5036 from x86_64-float-bitops1
to main
:
<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->This patch implements float bitops on x86_64 using SSE instructions.
@afonso360
- [x] Check if better single slot bitops available on x86_64. (which has better latency or throughput?...)
- [ ] Make single slot mask for
f32
,f64
instead ofvector_all_ones
ArtBlnd edited PR #5036 from x86_64-float-bitops1
to main
:
<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->This patch implements float bitops on x86_64 using SSE instructions.
@afonso360
- [ ] Check if better single slot bitops available on x86_64. (which has better latency or throughput?...)
- [ ] Make single slot mask for
f32
,f64
instead ofvector_all_ones
ArtBlnd updated PR #5036 from x86_64-float-bitops1
to main
.
ArtBlnd edited PR #5036 from x86_64-float-bitops1
to main
:
<!--
Please ensure that the following steps are all taken care of before submitting
the PR.
[ ] This has been discussed in issue #..., or if not, please tell us why
here.[ ] A short description of what this does, why it is needed; if the
description becomes long, the matter should probably be discussed in an issue
first.[ ] This PR contains test cases, if meaningful.
- [ ] A reviewer from the core maintainer team has been assigned for this PR.
If you don't know who could review this, please indicate so. The list of
suggested reviewers on the right can help you.Please ensure all communication adheres to the code of conduct.
-->This patch implements float bitops on x86_64 using SSE instructions.
@afonso360
- [x] Check if better single slot bitops available on x86_64. (which has better latency or throughput?...)
- [ ] Make single slot mask for
f32
,f64
instead ofvector_all_ones
Last updated: Nov 22 2024 at 16:03 UTC