exllama icon indicating copy to clipboard operation
exllama copied to clipboard

Support non-Llama architectures

Open dred0n opened this issue 2 years ago • 0 comments

exLlama saved GPTQ, I've gone from 6 token/s to over 40, thank you! Currently it's only supports Llama based models.

Here's a few other promising architectures such as: MPT Falcon SalesForce StarCoder ChatGPT

Are there plans to support these other architectures?

dred0n avatar Jul 05 '23 17:07 dred0n