OpenBLAS icon indicating copy to clipboard operation
OpenBLAS copied to clipboard

Add ASIMD Small GEMM kernels

Open Mousius opened this issue 1 year ago • 1 comments

These are experiments to see whether or not we can improve performance a bit on 128-bit SVE cores by using ASIMD instead.

Mousius avatar Nov 04 '24 11:11 Mousius

These are probably helpful for #2712 even if they did not appear to result in any speedup for the mystery Graviton4 workload

martin-frbg avatar Jan 03 '25 21:01 martin-frbg