gemm topic

List gemm repositories

CLBlast

1.0k
Stars
202
Forks
Watchers

Tuned OpenCL BLAS

CTranslate2

2.9k
Stars
259
Forks
Watchers

Fast inference engine for Transformer models

laser

263
Stars
15
Forks
Watchers

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats a...

blislab

442
Stars
95
Forks
Watchers

BLISlab: A Sandbox for Optimizing GEMM

Tensile

195
Stars
136
Forks
Watchers

Stretching GPU performance for GEMMs and tensor contractions.

slibs

111
Stars
11
Forks
Watchers

Single file libraries for C/C++

Optimizing-SGEMM-on-NVIDIA-Turing-GPUs

264
Stars
43
Forks
Watchers

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

dbcsr

132
Stars
45
Forks
Watchers

DBCSR: Distributed Block Compressed Sparse Row matrix library

cublasgemm-benchmark

28
Stars
16
Forks
Watchers

code for benchmarking GPU performance based on cublasSgemm and cublasHgemm