pyllama
pyllama copied to clipboard
NVMLError_NoPermission: Insufficient Permissions
when use python quant_infer.py --wbits 2 --load pyllama-7B8b.pt --text "..." --max_length 24
, it raises error
NVMLError_NoPermission: Insufficient Permissions