cutlass icon indicating copy to clipboard operation
cutlass copied to clipboard

[QST] [CuTeDSL] Nsight Compute Profiler Link to Source Code

Open HanGuo97 opened this issue 5 months ago • 8 comments

What is your question?

When profiling CUDA/CUTLASS, the profiler can provide line-by-line profiling for user code, in addition to PTX and SASS. Triton can also do this, likely because its compiler tracks source locations. I believe CuTeDSL has a similar feature since it tracks source locations too. However, I’m unsure how to enable this, as the default ncu output only shows SASS. Do you happen to know how to enable detailed profiling if it’s possible?

HanGuo97 avatar Jul 10 '25 19:07 HanGuo97

Good suggestion! It's a very useful feature that we are considering to add ( ETA is TBD ).

fengxie avatar Jul 11 '25 07:07 fengxie

Just want to echo this, would make it much easier than just reading the SASS

tridao avatar Jul 11 '25 15:07 tridao

@brandon-yujie-sun

fengxie avatar Jul 11 '25 15:07 fengxie

I would also very much like this. I think the lack of the source-mapping also makes debugging IMAs with compute-sanitizer much harder as well.

Chillee avatar Jul 30 '25 22:07 Chillee

Thanks for all the inputs. Investigation is in progress.

brandon-yujie-sun avatar Aug 01 '25 02:08 brandon-yujie-sun

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

github-actions[bot] avatar Aug 31 '25 03:08 github-actions[bot]

Adding that this is perhaps a dealbreaker for our use case of CuTe DSL, profiling a kernel is highly important during high performance kernel development.

odelame avatar Oct 12 '25 10:10 odelame

Folks, 4.3 dev added source location tracking for DSL APIs which enables the source code correlation for DSL codes with the profiling and debugging. Please let us know if you see any issues with that.

brandon-yujie-sun avatar Oct 22 '25 07:10 brandon-yujie-sun