simde icon indicating copy to clipboard operation
simde copied to clipboard

SSE4.2 functions

Open nemequ opened this issue 7 years ago • 8 comments

It would be good to support the SSE4.2 instruction set, which are mostly string functions.

This might be a good first issue; you don't need to implement all of the functions at once, and you if your machine supports SSE4.2 natively it's pretty easy to test.

You can find details about each one at https://software.intel.com/sites/landingpage/IntrinsicsGuide/#techs=SSE4_2

For progress, see https://github.com/simd-everywhere/implementation-status/blob/main/x86.md#sse42

nemequ avatar Mar 31 '17 02:03 nemequ

@nemequ Could you set up the build / test infrastructure for this? There is some interest @ FU Berlin in using SIMDe w/ SSE4.2

mr-c avatar Feb 27 '20 13:02 mr-c

It's all ready to go, though it was a bit difficult to understand how to use it without looking at another ISA extension. I just added _mm_cmpgt_epi64 (11b0c2e7e08afbddad99ebb26ed66b52568a8ab7) as an example, hopefully that helps clear things up.

nemequ avatar Feb 27 '20 20:02 nemequ

I might consider this issue to work on as part of GSoC, is anybody working on it already?

FaresSalem avatar Mar 24 '20 21:03 FaresSalem

Yes, @masterchef2209 is planning to work on this. Sorry.

nemequ avatar Mar 25 '20 07:03 nemequ

Any progress on this?

pthariensflame avatar May 04 '22 20:05 pthariensflame

I'm missing intrinsic _mm_cmpestri.

sebpop avatar Jun 03 '22 12:06 sebpop

I'm missing intrinsic _mm_cmpestri.

Recently, SSE2NEON implements both _mm_cmpestri and _mm_cmpestrm.

jserv avatar Oct 08 '22 04:10 jserv

I'm missing intrinsic _mm_cmpestri.

Recently, SSE2NEON implements both _mm_cmpestri and _mm_cmpestrm.

Yes, they have all of the SSE4.2 functions we are missing; porting those to SIMDe style would be greatly appreciated!

https://github.com/DLTcollab/sse2neon/blob/39d854002840f12baa85cfdf01b120eabdfa76ed/sse2neon.h#L8093-L8097

mr-c avatar May 19 '23 11:05 mr-c