mistral.rs
mistral.rs copied to clipboard
Is there any plans to support AWQ and GPTQ in the future?
support quantization of awq and gptq
Hi @franklucky001! I do plan on supporting other quantization method such as AWQ and GPTQ in the future. I would probably start with GPTQ.
@franklucky001 we are going to merge GPTQ soon!
@franklucky001 GPTQ and Marlin support have been merged already.
So does AWQ work? For some reason I could've sworn it was a form of GPTQ so I figure there's a possibility. Wish I had an AWQ model downloaded to try