aphrodite-engine icon indicating copy to clipboard operation
aphrodite-engine copied to clipboard

[Feature]: add back exl2 support?

Open joshuakoh1 opened this issue 1 year ago • 2 comments

🚀 The feature, motivation and pitch

Why was exl2 support dropped?

Is there anything that the community can help with that is stuck?

Alternatives

No response

Additional context

No response

joshuakoh1 avatar Dec 12 '24 08:12 joshuakoh1

+1, happy to help in any way I can

discordianbelle avatar Dec 13 '24 02:12 discordianbelle

The primary culprit was the upstream PR vllm-project/vllm#3977, which drastically changed how quantized layers were handled. This made working with exllamav2 extremely difficult. If someone can make the existing exl2 quantization work with the changes from that PR, it should be easier to manage.

AlpinDale avatar Dec 14 '24 22:12 AlpinDale