simde
simde copied to clipboard
Implementations of SIMD instruction sets for systems which don't natively support them.
https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#othertechs=ADX https://en.wikipedia.org/wiki/Intel_ADX - ADCX Adds two unsigned integers plus carry, reading the carry from the carry flag and if necessary setting it there. Does not affect other flags than the...
https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#othertechs=BMI2 https://en.wikipedia.org/wiki/X86_Bit_manipulation_instruction_set#BMI2_(Bit_Manipulation_Instruction_Set_2)
https://en.wikipedia.org/wiki/List_of_discontinued_x86_instructions#FMA4_instructions [GCC header](https://gcc.gnu.org/git/?p=gcc.git;a=blob;f=gcc/config/i386/fma4intrin.h;h=23d36b9f5fa586a536b1b90d57c179e85f72d2cd;hb=HEAD) [llvm header](https://github.com/llvm/llvm-project/blob/4998587e6f5f66d464ac22ad4c11fe9afd2d56ab/clang/lib/Headers/fma4intrin.h)
https://en.wikipedia.org/wiki/SSE4#SSE4a - EXTRQ/INSERTQ Combined mask-shift instructions. - MOVNTSD/MOVNTSS Scalar streaming store instructions. Header is [](https://gcc.gnu.org/git/?p=gcc.git;a=blob;f=gcc/config/i386/ammintrin.h;h=b94731a9e2ca64529189a3ed8b16a5147c27c652;hb=HEAD)
https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#othertechs=VAES https://en.wikipedia.org/wiki/AVX-512#VAES Implementation status: https://github.com/simd-everywhere/implementation-status/blob/main/x86.md#vaes
@eric900115 can you do this?
Recent [Qemu confirms](https://github.com/simd-everywhere/simde/issues/1099#issuecomment-2009639035) the different behavior that was seen on [mac M1](https://github.com/simd-everywhere/simde/issues/1099#issuecomment-1792610406) and [a Pixel 4a 5g phone](https://github.com/simd-everywhere/simde/issues/1099#issuecomment-1799379269) versus our previous implementation. To fix this: 1. Revert https://github.com/simd-everywhere/simde/commit/339ffe4ca89bf6698cab0c8420fe813a4e293aec 2. Adjust...
WebKit project recently imported simde 0.8.2 Since then the build for Aarch64 targeting RPi4 boards has started giving lot of warnings related to the simde header: ` warning: "__ARM_NEON_FP" is...
https://github.com/simd-everywhere/simde/commit/339ffe4ca89bf6698cab0c8420fe813a4e293aec Please test using real hardware ; see https://github.com/simd-everywhere/simde/issues/1099 for more details
_Float16 is only supported on a few architectures. Let's assume it's not supported unless we know otherwise. References: * https://bugs.webkit.org/show_bug.cgi?id=274086 * https://gcc.gnu.org/onlinedocs/gcc/Half-Precision.html * https://clang.llvm.org/docs/LanguageExtensions.html#half-precision-floating-point