gemm
gemm copied to clipboard
Low Level API with Pre Allocated Work Space Exposed
It would be great if there will be, for any function, a low level API which exposes the needed workspace to avoid any allocations of the function.
The work is impressive. Being competitive with the big guys is nothing short of amazing considering this is a single person show.