matrix-multiply topic
List
matrix-multiply repositories
cuda_hgemm
270
Stars
62
Forks
Watchers
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
cuda_hgemv
48
Stars
4
Forks
Watchers
Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.