ucx
ucx copied to clipboard
UCT/CUDA_IPC: Use active-queues to track outstanding work
What/Why ?
Currently CUDA_IPC transport uses integer stream_count to track outstanding work but in preparation for multi-device support, this PR moves to active_queue usage similar to cuda_copy transport. This will eventually also help unify more common code shared between cuda_ipc and cuda_copy when it comes to stream/event usage. This PR also removes max peer limitations.
/azp run UCX PR
Azure Pipelines successfully started running 1 pipeline(s).
/azp run
Azure Pipelines successfully started running 4 pipeline(s).