rocm topic
quokka
Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
awesome-cuda-and-hpc
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, TensorRT and High Performance Computing (HPC) projects.
spla
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
automatic1111-webui-nix
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Tiled-MM
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
RET
ROCm Machine Learning and HPC Stack installer
rocSOLVER
Next generation LAPACK implementation for ROCm platform