MiniGPT-4
MiniGPT-4 copied to clipboard
GPTQ quantization version Vicuna?
Can we use a GPTQ quantized version of Vicuna v0 as the backbone?
First, thanks for referring the GPTQ quantized version to us! We don't test this before. We will have a check once we are available for this. Thank you!
First, thanks for referring the GPTQ quantized version to us! We don't test this before. We will have a check once we are available for this. Thank you!
I've made it run with 4bit GPTQ quantized, and it works fine. Generation is significantly fast but I can observe loss of performance.
First, thanks for referring the GPTQ quantized version to us! We don't test this before. We will have a check once we are available for this. Thank you!
I've made it run with 4bit GPTQ quantized, and it works fine. Generation is significantly fast but I can observe loss of performance.
Can you please provide insights on how you have made it?