ucx icon indicating copy to clipboard operation
ucx copied to clipboard

UCT/CUDA_COPY: detect device transfers and report peak arch bandwidth

Open Akshay-Venkatesh opened this issue 2 years ago • 1 comments

What

Detect if remote/local memory types for perf estimate is of type cuda/cuda-managed. If so, report peak device memory bandwidth

Why ?

Preparation for device staging pipeline protocols. Without this patch, only estimated peak host<->cuda bandwidth is reported which may not allow for device bounce buffer selection.

Akshay-Venkatesh avatar Mar 10 '22 19:03 Akshay-Venkatesh

cc @yosefe @bureddy

Akshay-Venkatesh avatar Mar 10 '22 19:03 Akshay-Venkatesh