gdrcopy
gdrcopy copied to clipboard
Add NVTX instrumentation
Depending on how often communication buffers are reused in applications the registration costs with GDRCopy, e.g. gdr_pin_buffer
and gdr_map
, can be significant. To allow identifying these situations with profiler tools like NSight Systems it would be very helpful if these calls are instrumented with NVTX.