auto-round
auto-round copied to clipboard
[New Feature Request] layer wise sensitivity analysis
Focus on WA quantization of MX_FP first refer to https://github.com/intel/neural-compressor/blob/master/neural_compressor/strategy/bayesian.py https://github.com/pytorch/ao/tree/main/torchao/quantization/prototype/mixed_precision