Arturo Vargas issues

Results 58 issues of


                                            Arturo Vargas

Thread loop optimizations RAJA launch

This PR is a collaboration space for exploring optimization within RAJA launch and the loop abstraction

Remove need of having empty files for unsupported backends

Certain backends are not supported, we should reconfigure our framework so empty files are not required for building.

CodeOrg

Add more variants for Massvec3D

DRAFT-PR. -- This MR adds the option to store threadblock info in the launch ctx avoiding calling blockDim.x,y,z during the loop methods in raja launch

Revisit const Real_ptr usage

Some of kernels use const Real_ptr; we believe usage should be Real_const_ptr.

FEM Kernel Update

- [ ] Add an atomic variant for the mass PA kernel - [ ] Update Diffusion kernel -- mfem version has been updated - [ ] Update reference link...

Ensure consistent use of block size vs blockIdx.x in GPU kernels

Some kernels have been observed to use blockIdx.x while others use the templated blocksize. We should do a pass to ensure consistency and consider different "tuning" versions if we want...

invalid

Testing direct vs loop stride

# Summary This test modifies an existing kernel to use direct threading.

Avoid block stride loop on AMD GPUs to increase performance for FEM kernels

It has been observed that performing block stride loops on AMD decreases performance, to increase performance use a direct mapping. Please see FEM kernels under apps.