cublas topic

List cublas repositories

bandicoot-code

24
Stars
5
Forks
Watchers

Bandicoot: C++ library for GPU linear algebra & scientific computing - https://coot.sourceforge.io

awesome-cuda-and-hpc

134
Stars
16
Forks
Watchers

🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, TensorRT and High Performance Computing (HPC) projects.

cuda_hgemm

270
Stars
62
Forks
Watchers

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

cuda_hook

129
Stars
33
Forks
Watchers

Hooked CUDA-related dynamic libraries by using automated code generation tools.

DSAbeamformer

17
Stars
6
Forks
Watchers

Real-time GPU Beamformer for DSA110 written in C/CUDA

Tiled-MM

22
Stars
7
Forks
Watchers

Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.

caffe-escoin

15
Stars
2
Forks
Watchers

Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs

mkl-verbose-toolkit

15
Stars
3
Forks
Watchers

Tools to run and parse MKL verbose mode

cuda_hgemv

48
Stars
4
Forks
Watchers

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.

cuda-beginner-course-cpp-version

24
Stars
3
Forks
Watchers

bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码