radeon_gpu_analyzer
radeon_gpu_analyzer copied to clipboard
wrong number of cycles for V_SQRT_F32
V_SQRT_F32 should not be 4 cycles, but 16 - the same as V_RSQ_F32 https://github.com/GPUOpen-Tools/RGA/blob/f3f9539759c22e1a01bf184397ff089e95638d86/RadeonGPUAnalyzerBackend/Emulator/Parser/VOPInstruction.h#L180
Maybe check that the other instructions marked with "// guess" are accurate?
Thanks for reporting this. We need to update these estimates to reflect some architectural changes that were introduced in RDNA. We would do our best to get to it in one of our upcoming releases.