auto-round [New Feature Request] layer wise sensitivity analysis

[New Feature Request] layer wise sensitivity analysis

Open wenhuach21 opened this issue 5 months ago • 0 comments

Focus on WA quantization of MX_FP first refer to https://github.com/intel/neural-compressor/blob/master/neural_compressor/strategy/bayesian.py https://github.com/pytorch/ao/tree/main/torchao/quantization/prototype/mixed_precision

Sep 11 '24 06:09 wenhuach21

auto-round auto-round copied to clipboard

[New Feature Request] layer wise sensitivity analysis

auto-round
auto-round copied to clipboard