vllm
vllm copied to clipboard
[Feature]: How to run the int4 quantized version of the gemma2-27b model
🚀 The feature, motivation and pitch
How to run the int4 quantized version of the gemma2-27b model
Alternatives
No response
Additional context
No response