nvbench icon indicating copy to clipboard operation
nvbench copied to clipboard

Profile only the kernels involved in the benchmark

Open fbusato opened this issue 9 months ago • 0 comments

NVBench currently allows kernel profiling with external tools with the flag --profile. On the other hand, profiling tools collect all activities in the benchmark, including "setup/initialization" kernels that are not part of the section under evaluation. NVBench should add cudaProfilerStart() and cudaProfilerStop() in profiling mode (--profile).

CUDA Runtime API reference

fbusato avatar May 21 '25 17:05 fbusato