text-generation-webui icon indicating copy to clipboard operation
text-generation-webui copied to clipboard

Not find the models...

Open luussta opened this issue 1 year ago • 9 comments

Describe the bug

Y use the commands: cd text-generation-webui call python server.py --chat --pre_layer 31 --wbits 4 --groupsize 128 --model gpt4-x-alpaca-13b-native-4bit-128g

And the error name: Could not find the quantized model in .pt or .safetensors format, exiting...

Is there an existing issue for this?

  • [X] I have searched the existing issues

Reproduction

cd text-generation-webui call python server.py --chat --pre_layer 31 --wbits 4 --groupsize 128 --model gpt4-x-alpaca-13b-native-4bit-128g

Screenshot

Could not find the quantized model in .pt or .safetensors format, exiting...

Logs

Could not find the quantized model in .pt or .safetensors format, exiting...

System Info

Gpu: Nvidia 3050
Ram: 8gb

luussta avatar Apr 21 '23 21:04 luussta

How does your models folder look like?

myluki2000 avatar Apr 21 '23 22:04 myluki2000

image

luussta avatar Apr 21 '23 22:04 luussta

cd text-generation-webui call python server.py --chat --pre_layer 31 --wbits 4 --groupsize 128 --model gpt4-x-alpaca-13b-native-4bit-128g

Model name must match folder name:

--model anon8231489123_gpt4-x-alpaca-13b-native-4bit-128g

Juqowel avatar Apr 22 '23 03:04 Juqowel

Putting that command, it finds the model but says: DefaultCPUAllocator, any idea for how to fix it?

luussta avatar Apr 22 '23 15:04 luussta

4-bit models get loaded into RAM before being sent to VRAM. 16GB RAM - minimum for 13b-4bit model. (10-11GB free RAM).

Juqowel avatar Apr 22 '23 17:04 Juqowel

What can i do?

luussta avatar Apr 22 '23 17:04 luussta

Anyone has a solution?

luussta avatar Apr 22 '23 17:04 luussta

get a better computer or run the AI using Google Colab.

myluki2000 avatar Apr 23 '23 10:04 myluki2000

get a better computer or run the AI using Google Colab.

I will use google colab. 😢

luussta avatar Apr 23 '23 13:04 luussta

This issue has been closed due to inactivity for 30 days. If you believe it is still relevant, please leave a comment below.

github-actions[bot] avatar May 23 '23 23:05 github-actions[bot]