Silver Yu

Results 1 comments of Silver Yu

Does this mean that intermediate values are still stored in 16-bit precision? If so, does this imply that W8A8 quantization doesn’t actually reduce peak memory usage? In my project, I...