x86-simd-sort
x86-simd-sort copied to clipboard
Improve argsort for 32-bit
32-bit argsort uses ymm registers: we can switch to zmm registers (use 2x i64gather instructions) and add new bitonic networks.