kineto icon indicating copy to clipboard operation
kineto copied to clipboard

Visualizing pytorch Emited NVTX with Tensorboard Profiler

Open jmoork opened this issue 2 years ago • 1 comments

There are important Pytorch Specific markers generated during the training loop either using NVTX or other means from pytorch lightning.

It would be useful to show the time line view with execution time for different pytorch training sections of the code as shown in the attached image. The current trace view is quite detailed and goes down one-step further with detailed cuda kernels etc. It would be useful to have Pytorch Execution specific trace view that is easy to understand and intuit to find any synchronization or communication bottlenecks in the training loop and also compare different execution time for different executing region of the training/validation code (like forward, loss, data-loading etc).

Screenshot 2023-07-19 at 5 05 22 PM

jmoork avatar Jul 20 '23 00:07 jmoork

Hello, is NVTX captured by kineto atm?

jon-chuang avatar Mar 27 '24 10:03 jon-chuang