chained-scan-with-decoupled-lookback topic

List chained-scan-with-decoupled-lookback repositories

GPUPrefixSums

77
Stars
5
Forks
Watchers

A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.