RAJAPerf
RAJAPerf copied to clipboard
Avoid block stride loop on AMD GPUs to increase performance for FEM kernels
It has been observed that performing block stride loops on AMD decreases performance, to increase performance use a direct mapping. Please see FEM kernels under apps.