sraikund16 issues

Results 13 issues of


                                            sraikund16

Add Sanity Testing to Pytorch Profiler

Summary: In the recent weeks, we have encountered bugs in both the normal synchronous trace and on-demand tracing. This diff on its own does sanity checking to make sure the...

fb-exported

ciflow/trunk

release notes: profiler

topic: not user facing

ciflow/rocm

Fix torch.profiler Schedule Function (Function Event only)

Summary: github issue: https://github.com/pytorch/pytorch/issues/73828 Whenever we transition from record and save to warmup, we instantiate a new backend profiler which wipes out the last cycles information. We should keep the...

fb-exported

ciflow/trunk

ciflow/periodic

ciflow/rocm

Make Kineto Stage Log Level Lower

Summary: Users have complained that STAGE level logs are print by default. If a user is running many profiles it can certainly clutter STDOUT. Lower the STAGE level such that...

cla signed

Remove MAX_STACK_ENTRY from _build_table

Summary: As reported by this issue: https://github.com/pytorch/pytorch/issues/83584 We already store the entries in evt.stack so there is no need to cap the limit when we output the table to 5...

fb-exported

Add Logging for Empty Traces

Summary: Recently, we have had users seen empty traces when the system is idle leading to confusion as to whether it was caused by a bug in kineto formatting or...

fb-exported

cla signed

Upgrade to CUDA 12.4 is causing segfaults in 4 Range Profiler Tests

After upgrading the CUDA image to 12.4 we are having segfault failures in the following tests: 19 - CuptiRangeProfilerApiTest.asyncLaunchUserRange (SEGFAULT) 20 - CuptiRangeProfilerApiTest.asyncLaunchAutoRange (SEGFAULT) 24 - CuptiRangeProfilerTest.UserRangeTest (SEGFAULT) 25 -...

Align Roctracer to TSC Clock

Summary: Right now we align Roctracer events to system clock blindly regardless of what we are using in torch.profiler. We should use a clock based on what is defined instead....

fb-exported

cla signed

Roctracer events show up out of order

Summary: Add debug to show that some events in roctracer can start before the previous one ends Differential Revision: D63033163

fb-exported

cla signed

[DO NOT MERGE] add debug for roctracer rccl events

Differential Revision: D61943373

fb-exported

cla signed

[Issue]: Roctracer reports GPU Events Ending at Same Time Next Event starts

### Problem Description We notice that many of the events in Roctracer for a single GPU and single queue have a "tie". The first event ends at the exact same...