CUDA topic
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
how-to-optimize-gemm
row-major matmul optimization
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computa...
ngp_pl
Instant-ngp in pytorch+cuda trained with pytorch-lightning (high quality with high speed, with only few lines of legible code)
thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
libcudacxx
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
cutlass
CUDA Templates for Linear Algebra Subroutines
LuxCore
LuxCore source repository
GPU-Raytracer
GPU Raytracer from scratch in C++/CUDA
plotoptix
Data visualisation and ray tracing in Python based on OptiX 7.7 framework.