OpenBLAS icon indicating copy to clipboard operation
OpenBLAS copied to clipboard

[WIP]Optimize gemm for small matrix

Open xianyi opened this issue 4 years ago • 2 comments

  • [x] Add basic implementation ( please check aae6af94bbe4f7ad97c417e40fe6a7d4a2798b79 )

  • [ ] Merge sgemm_kernel_direct implementation

  • [ ] Work for DYNAMIC_ARCH

  • [ ] Tune the input matrix size.

  • [ ] Add optimized kernel for architecture.

xianyi avatar Apr 28 '20 11:04 xianyi

Probably, you're aware of this project, but I'm leaving it here just for reference: https://github.com/hfp/libxsmm

antoscha avatar May 03 '20 15:05 antoscha

See https://github.com/xianyi/OpenBLAS/issues/3783

ValZapod avatar Oct 06 '22 02:10 ValZapod