Wenhua Cheng
Results
24
issues of
Wenhua Cheng
Currently, we need to support each model individually, and the code lacks cohesion and uniformity.
Focus on WA quantization of MX_FP first refer to https://github.com/intel/neural-compressor/blob/master/neural_compressor/strategy/bayesian.py https://github.com/pytorch/ao/tree/main/torchao/quantization/prototype/mixed_precision
https://github.com/pytorch/ao/pull/870