llm-awq GGUF export support / CPU inference

GGUF export support / CPU inference

Open TomekPro opened this issue 6 months ago • 0 comments

Hi, are the any plans to add a support for GGUF export for CPU inference? Or is there any other way to inference AWQ quantized model on CPU?

Thanks Tomek

Aug 05 '24 09:08 TomekPro