Moyang Wang
Moyang Wang
There is no native half-precision support on NVIDIA Ampere (except for A100) or Ada GPU. Their half-precision performance is the same as single-precision.
@BA8F0D39 This seems to be a problem with NVIDIA's OpenCL implementation. When querying device capabilities by apps (like clpeak), it reports "no half-precision support". I observed the same issue with...
I guess it means no "native support" for half-precision. FWIW Ada emulates FP16 using FP32, hence FP16 and FP32 have the same tflops. In contrast, recent AMD architectures (RDNA, CDNA)...