Clemens Lutz

Results 2 issues of Clemens Lutz

Pascal and newer devices support cooperative groups. These groups enable kernels to, e.g., globally synchronize the grid without terminating the kernel. A special function `cuLaunchCooperativeKernel` was added to CUDA 9.0...

Pascal and newer devices support a shared memory size larger than 48 KiB per thread group. This is an opt-in feature that was introduced in CUDA 9.0 by specifying the...