AMDMIGraphX
AMDMIGraphX copied to clipboard
[Q] Is nearbyint necessary for the FP8 quantizelinar ?
For the interger quantization, quantizelinear operation is
nearbyint(x / scale) + zeropoint.
FP8 is floating point operation already, Is nearbyint necessary in that case ?