mrciffa

Results 3 issues of mrciffa

As proposed from the the repo, i tried to quantize the starchat model because bigcode should be supported ```python from transformers import AutoTokenizer from auto_gptq import AutoGPTQForCausalLM, BaseQuantizeConfig import logging...

I tried to use vllm on my finetuned model from starcoder, but its seems not supported from the official package (?) In the README.md is said to be supported. ```...