blake2_simd
blake2_simd copied to clipboard
look into using simdeez
https://github.com/jackmott/simdeez
The goal would be to keep avx2 performance the same, but to get an SSE4 fallback implementation for free. The alternative would be to port this implementation directly: https://github.com/BLAKE2/BLAKE2/blob/master/sse/blake2b-round.h#L139