OpenBLAS [WIP]Optimize gemm for small matrix

[WIP]Optimize gemm for small matrix

Open xianyi opened this issue 4 years ago • 2 comments

[x] Add basic implementation ( please check aae6af94bbe4f7ad97c417e40fe6a7d4a2798b79 )
[ ] Merge sgemm_kernel_direct implementation
[ ] Work for DYNAMIC_ARCH
[ ] Tune the input matrix size.
[ ] Add optimized kernel for architecture.

Apr 28 '20 11:04 xianyi

Probably, you're aware of this project, but I'm leaving it here just for reference: https://github.com/hfp/libxsmm

May 03 '20 15:05 antoscha

See https://github.com/xianyi/OpenBLAS/issues/3783

Oct 06 '22 02:10 ValZapod