roctracer icon indicating copy to clipboard operation
roctracer copied to clipboard

[Issue]: Roctracer reports GPU Events Ending at Same Time Next Event starts

Open sraikund16 opened this issue 5 months ago • 0 comments

Problem Description

We notice that many of the events in Roctracer for a single GPU and single queue have a "tie". The first event ends at the exact same nanosecond the second one starts. This is a fairly innocuous bug but can skew kernel metrics if the times are not being reported correctly. Ideally there would be some buffer of nanoseconds between an event end and event start.

This seems to be a different problem than https://github.com/ROCm/roctracer/issues/104 as it seems to be some issue with granularity rather than mismatched timings.

Operating System

CentOS Stream 9

CPU

AMD EPYC 7713

GPU

AMD Instinct MI300X

ROCm Version

ROCm 6.2.0

ROCm Component

rocm-core, roctracer

Steps to Reproduce

No response

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

sraikund16 avatar Sep 23 '24 23:09 sraikund16