chained-scan-with-decoupled-lookback topic
List
chained-scan-with-decoupled-lookback repositories
GPUPrefixSums
77
Stars
5
Forks
Watchers
A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.