distiller
distiller copied to clipboard
Higher than 8-bit Quantization not working properly!?
Thanks for this great framework! I was wondering if there is an explicit 'no' or a limitation for quantizing weights and/or activations to higher than 8 bits using asymmetric methods? When I tried 16/32 for weights, on asymetric_s (similarly for activations) the accuracy drops to 0.2% while it should improve.