mrciffa
Results
3
issues of
mrciffa
As proposed from the the repo, i tried to quantize the starchat model because bigcode should be supported ```python from transformers import AutoTokenizer from auto_gptq import AutoGPTQForCausalLM, BaseQuantizeConfig import logging...
I tried to use vllm on my finetuned model from starcoder, but its seems not supported from the official package (?) In the README.md is said to be supported. ```...