RAJAPerf icon indicating copy to clipboard operation
RAJAPerf copied to clipboard

Avoid block stride loop on AMD GPUs to increase performance for FEM kernels

Open artv3 opened this issue 1 year ago • 2 comments

It has been observed that performing block stride loops on AMD decreases performance, to increase performance use a direct mapping. Please see FEM kernels under apps.

artv3 avatar Nov 25 '24 18:11 artv3