GPTQ-for-LLaMa
GPTQ-for-LLaMa copied to clipboard
probability tensor contains either `inf`, `nan` or element < 0
CUDA_VISIBLE_DEVICES=0 python llama_inference.py decapoda-research/llama-7b-hf --wbits 4 --load llama7b-4bit.pt --text "this is llama"
Loading model ...
Done.
Traceback (most recent call last):
File "llama_inference.py", line 115, in inf, nan or element < 0