CUDA topic

CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.

List CUDA repositories

how-to-optimize-gemm

583
Stars
78
Forks
Watchers

row-major matmul optimization

kaldi

13.8k
Stars
5.3k
Forks
Watchers

kaldi-asr/kaldi is the official location of the Kaldi project.

catboost

7.8k
Stars
1.2k
Forks
Watchers

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...

ngp_pl

1.2k
Stars
151
Forks
Watchers

Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with only few lines of legible code)

thrust

4.9k
Stars
758
Forks
Watchers

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

libcudacxx

2.3k
Stars
188
Forks
Watchers

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

cutlass

8.0k
Stars
1.3k
Forks
Watchers

CUDA Templates for Linear Algebra Subroutines

GPU-Raytracer

773
Stars
43
Forks
Watchers

GPU Raytracer from scratch in C++/CUDA

plotoptix

479
Stars
26
Forks
Watchers

Data visualisation and ray tracing in Python based on OptiX 7.7 framework.