CUDA topic
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
hemi
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
mini-nbody
A simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
nvml-wrapper
Safe Rust wrapper for the NVIDIA Management Library
jetson
Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlo...
cuda-design-patterns
Some CUDA design patterns and a bit of template magic for CUDA
CLTune
CLTune: An automatic OpenCL & CUDA kernel tuner
parallel-convolution
🖼️ Parallel Image Convolution, applying a blur filter to images. Written in C, optimized in three different ways: MPI, MPI & OpenMP and CUDA.
cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs