simde
simde copied to clipboard
Implementations of SIMD instruction sets for systems which don't natively support them.
Target: 2022-05-22
Here are lines 186-188 of simde/simde-features.h: ``` #if defined(SIMDE_X86_AVX_NATIVE) && !defined(SIMDE_X86_SSE4_1_NATIVE) #define SIMDE_X86_SSE4_2_NATIVE #endif ``` Here is how lines 186-188 of simde/simde-features.h should read: ``` #if defined(SIMDE_X86_AVX_NATIVE) && !defined(SIMDE_X86_SSE4_2_NATIVE) #define...
Including "simde/arm/neon.h" fails to compile on Visual C++ for an ARM64 target since some of the SIMD_CONSTIFY_2_, SIMDE_CONSTIFY_4_, SIMDE_CONSTIFY_8_, and SIMDE_CONSTIFY_16_ macros used in some of the header files in...
The simde_vaddvq_u8, simde_vaddlvq_u8, simde_vaddvq_s8, and simde_vaddlvq_s8 routines can be implemented on x86 platforms with SSE2 support using the _mm_sad_epu8 intrinsic. Here is how simde_vaddvq_u8 could be implemented on x86 platforms...
While the fix in simde/x86/avx512/permutex2var.h that was applied to commit https://github.com/simd-everywhere/simde/commit/88931164bdb8b628702079bfa177087a614cc975 does partially fix the issue with getting simde/x86/avx512/permutex2var.h to compile with Visual C++, I am still getting the following...
I find `_rdtsc`, `__rdtsc`, and/or `_rdtscp` in a lot of code that uses x86 intrinsics. I requested this feature from SSE2Neon (https://github.com/DLTcollab/sse2neon/issues/472) and some folks suggested an implementation for ARM....
Fixes #945
Hello, According to msvc's [documentation](https://docs.microsoft.com/en-us/cpp/build/reference/arch-x64?view=msvc-170), no `__FMA__` flag is defined when compiling with msvc, as its availbility is already implied in `__AVX2__`. Current detection macro only checks `__FMA__` thus native...
This pull request introduces support of Elbrus hardware platform (which is based on Russian [Elbrus](https://en.wikipedia.org/wiki/Elbrus-8S) CPU family) with its native lcc (eLbrus Compiler Collection) compiler. This is linked to the...
This appears to be an issue with GCC 8 and 9. I have built with Clang 10 on the same machine without issue, and with GCC 7, 10 and 11....