AutoAWQ icon indicating copy to clipboard operation
AutoAWQ copied to clipboard

[Feature] Support AWQ quantization for HF AutoModel (embedding models)

Open abhinavkulkarni opened this issue 1 year ago • 1 comments

Hi,

It should be possible to add AWQ quantization support for HuggingFace AutoModel which are used for generating embeddings.

Currently, AWQ support works for models of the type AutoModelForCausalLM.

Thanks!

abhinavkulkarni avatar Feb 12 '24 04:02 abhinavkulkarni

Hi @abhinavkulkarni, I would love to support more models - especially embedding models. If you are fortunate enough to have time to add support for these models, I would highly appreciate it

casper-hansen avatar Feb 16 '24 09:02 casper-hansen