Clemens Lutz
Results
2
issues of
Clemens Lutz
Pascal and newer devices support cooperative groups. These groups enable kernels to, e.g., globally synchronize the grid without terminating the kernel. A special function `cuLaunchCooperativeKernel` was added to CUDA 9.0...
Pascal and newer devices support a shared memory size larger than 48 KiB per thread group. This is an opt-in feature that was introduced in CUDA 9.0 by specifying the...