benchmark
benchmark copied to clipboard
TFLOPS calculation for TF32
The current TFLOPS is only for FP32. Need to add support for other floating point formats such as TF32 and FP16.