quanto
quanto copied to clipboard
Does AWQ is officially supported now?
I can see that optimum-quanto provides several external (weight-only) quantization algorithm such as smoothquant and awq in here.
It looks like smoothquant only supports OPT models, and awq is still under development. Do you have any further development plans for AWQ?