radeon_gpu_analyzer icon indicating copy to clipboard operation
radeon_gpu_analyzer copied to clipboard

wrong number of cycles for V_SQRT_F32

Open pixelmager opened this issue 6 years ago • 1 comments

V_SQRT_F32 should not be 4 cycles, but 16 - the same as V_RSQ_F32 https://github.com/GPUOpen-Tools/RGA/blob/f3f9539759c22e1a01bf184397ff089e95638d86/RadeonGPUAnalyzerBackend/Emulator/Parser/VOPInstruction.h#L180

Maybe check that the other instructions marked with "// guess" are accurate?

pixelmager avatar Jul 25 '19 13:07 pixelmager

Thanks for reporting this. We need to update these estimates to reflect some architectural changes that were introduced in RDNA. We would do our best to get to it in one of our upcoming releases.

AmitBM avatar Jul 29 '19 18:07 AmitBM