taichi_benchmark Parallel scan performance gap between Vulkan and CUDA

Parallel scan performance gap between Vulkan and CUDA

Open qiao-bo opened this issue 2 years ago • 0 comments

Currently we support warp-based parallel scan for Vulkan and CUDA. Lets use this issue to track some performance data:

ENV: RTX3080 with Driver 510. CUDA 11.6.

Jul 21 '22 06:07 qiao-bo