rocm topic
List
rocm repositories
tvm
11.3k
Stars
3.4k
Forks
Watchers
Open deep learning compiler stack for cpu, gpu and specialized accelerators
stdgpu
1.1k
Stars
78
Forks
Watchers
stdgpu: Efficient STL-like Data Structures on the GPU
alpaka
328
Stars
67
Forks
Watchers
Abstraction Library for Parallel Kernel Acceleration :llama:
amdovx-core
149
Stars
53
Forks
Watchers
AMD OpenVX Core -- a sub-module of amdovx-modules:
nsimd
316
Stars
29
Forks
Watchers
Agenium Scale vectorization library for CPUs and GPUs
COSMA
179
Stars
27
Forks
Watchers
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
gpufort
157
Stars
14
Forks
Watchers
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify