abrown opened PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown requested julian-seward1 for a review on PR #1409.
julian-seward1 submitted PR Review.
julian-seward1 created PR Review Comment:
Read naively, this would seem to insert the mask table once for every converted ishl. That would be ungood. Does
pos.func.dfg.constants.insert
de-dup, and hence keep just one copy?
julian-seward1 submitted PR Review.
julian-seward1 created PR Review Comment:
What does
this
refer to here? On the whole I'd prefer to stick with the Intel style convention, where the rightmost lane is lane zero. But note; my comment is very "drive-by"; I don't know what the lane numbering conventions in CL are.
julian-seward1 submitted PR Review.
julian-seward1 created PR Review Comment:
Did you mean "shift an extra 8 bits" ?
julian-seward1 submitted PR Review.
julian-seward1 submitted PR Review.
julian-seward1 submitted PR Review.
julian-seward1 submitted PR Review.
julian-seward1 submitted PR Review.
julian-seward1 submitted PR Review.
abrown submitted PR Review.
abrown created PR Review Comment:
ConstantPool
does de-duplicate values but currently it is limited to a per-function boundary because of how things are compiled. I opened #1385 to talk about this some more: I think with a bit of thought we could de-duplicate across modules at least.
abrown submitted PR Review.
abrown created PR Review Comment:
Ok, we can switch to Intel-style. I went back and forth on this one and I can't recall why I chose this ordering. I'll look around the file to see if there was something that made me choose this.
abrown submitted PR Review.
abrown created PR Review Comment:
Yup, good catch.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown updated PR #1409 from additional-i8x16-shift
to master
:
This is a follow-on to #1377; review once that PR is merged. This PR adds support for i8x16 left shift and arithmetic right shift. Both are long sequences of instructions on x86.
abrown merged PR #1409.
Last updated: Jan 24 2025 at 00:11 UTC