AMDGPU.jl
AMDGPU.jl copied to clipboard
Implement occupancy estimator
We should be able to guess how well a given kernel can occupy a given piece of hardware. We should then be able to allow @roc groupsize=auto ... to automatically select a groupsize when it's irrelevant to the given kernel.