vllm
vllm copied to clipboard

Published 20 hours ago •

Reame
Issues

[Feature]: How to run the int4 quantized version of the gemma2-27b model

Open maxin9966 opened this issue 6 months ago • 5 comments

🚀 The feature, motivation and pitch

How to run the int4 quantized version of the gemma2-27b model

Alternatives

No response

Additional context

No response

Aug 04 '24 13:08 maxin9966