nvbench icon indicating copy to clipboard operation
nvbench copied to clipboard

Async benchmarks always deadlock

Open gevtushenko opened this issue 2 years ago • 1 comments

The recent switch to lazy loading by default in CTK 12.2 seems to have broken the async benchmarks. This can be reproduced by nvbench.example.axes. The deadlock can be fixed by CUDA_MODULE_LOADING=EAGER. We should incorporate this information into the error message or set the variable ourselves.

gevtushenko avatar Jun 30 '23 17:06 gevtushenko

We likely want eager loads by default anyway to make sure that lazy loads aren't affecting measurements. Let's look into defining that var from the NVBench main implementation.

alliepiper avatar Aug 08 '23 13:08 alliepiper