auto-round
auto-round copied to clipboard
WIP: fix compat with latest autogptq and use meta region to store auto-round properties
Reason for PR:
- Fix compat with latest autogptq
- Store autoround fingerprint/version using
meta_set_quantizer(name, version)
api - Store autoround specific parameters, unrelated to actual autogptq inference/quantization, into meta region via
meta_set
api - add tqdm progress to quantization so user get good estimation of iter/s and remaining time
Pending merge/changes to https://github.com/AutoGPTQ/AutoGPTQ/pull/640
- [x] Tested quant with autogptq inference with sym=True and False.