Silver Yu
Results
1
comments of
Silver Yu
Does this mean that intermediate values are still stored in 16-bit precision? If so, does this imply that W8A8 quantization doesn’t actually reduce peak memory usage? In my project, I...