exllama
exllama copied to clipboard
When will the bfloat16 type of GPTQ algorithm be supported?