cublas topic

List cublas repositories
trafficstars

computeWorks_examples

5
Stars
1
Forks
Watchers

Matrix multiplication example performed with OpenMP, OpenACC, BLAS, cuBLABS, and CUDA

nvml_examples

20
Stars
1
Forks
Watchers

Examples showing how to utilize the NVML library for GPU monitoring

cupy

9.0k
Stars
829
Forks
Watchers

NumPy & SciPy for GPU

scikit-cuda

973
Stars
179
Forks
Watchers

Python interface to GPU-powered libraries

deeppipe2

92
Stars
5
Forks
Watchers

Deep Learning library using GPU(CUDA/cuBLAS)

cuda-swift

81
Stars
8
Forks
Watchers

Parallel Computing Library for Linux and macOS & NVIDIA CUDA Wrapper

cublasgemm-benchmark

28
Stars
16
Forks
Watchers

code for benchmarking GPU performance based on cublasSgemm and cublasHgemm

learn-gpgpu

51
Stars
15
Forks
Watchers

Algorithms implemented in CUDA + resources about GPGPU

cublasHgemm-P100

34
Stars
13
Forks
Watchers

Code for testing the native float16 matrix multiplication performance on Tesla P100 and V100 GPU based on cublasHgemm

cudarc

439
Stars
59
Forks
Watchers

Safe rust wrapper around CUDA toolkit