avx2 topic
fast-bernoulli
Fast generation of long sequencies of bernoulli-distributed random variables
tensorflow-optimized-wheels
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
fast-hex
Fast, SIMD hex string encoder and decoder C++ lib and Node.js module
base64simd
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
libpopcnt
🚀 Fast C/C++ bit population count library
highwayhash
Node.js implementation of HighwayHash, Google's fast and strong hash function
simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
highway
Performance-portable, length-agnostic SIMD with runtime dispatch