fss
fss copied to clipboard
PRG with CUDA parallelism and the new PRG API for that
We are developing a CUDA-accelerated PRG project myl7/fss-prg-cuda for this project, hoping further improving the performance.
Since the new PRG needs multiple blocks to be processed every time, a new PRG API is required.