kineto
kineto copied to clipboard
CUDA Dynamic Parallelism Launches are Invisible
As the title states, we've noticed kernels using dynamic parallelism do not show up in the profiler. It would be a nice quality of life change if they did.