AMDGPU.jl
AMDGPU.jl copied to clipboard
Feature request: allow `hsa_amd_memory_copy_async` to pick a queue
This would expose similar API as CUDA.jl's CUDA.Mem.unsafe_copy3d!.
Suggestion from @jpsamaroo: write a memcopy kernel and put it directly in AMDGPU.jl exposing an argument to specify which queue to use.
@jpsamaroo #220 may close this?
We now support this with HIP version.