stanford_alpaca
stanford_alpaca copied to clipboard
inference llama-7b error
response is none and the generated_token_ids are all 7
same error!!! Have you solved it yet?
I encountered the same error when using alpaca-lora for inference. In the end, it was because using bitsandbytes to quantify the model failed. I wonder if this is the problem for you.