CTranslate2
CTranslate2 copied to clipboard
feat: add ruy sgemm implementation
This PR adds SGEMM implementation with RUY. This is already mentioned in faster-whisper
repository.
I implemented this, because my experience with BLAS on Android was worse than this, and BLAS performance was not consistent (It sometimes takes too long). In addition, this requires no additional dependency and allows one to use ctranslate2
without BLAS.