llm-awq icon indicating copy to clipboard operation
llm-awq copied to clipboard

GGUF export support / CPU inference

Open TomekPro opened this issue 6 months ago • 0 comments

Hi, are the any plans to add a support for GGUF export for CPU inference? Or is there any other way to inference AWQ quantized model on CPU?

Thanks Tomek

TomekPro avatar Aug 05 '24 09:08 TomekPro