LLMChat error loading model: failed to open models/llama/Drararara

error loading model: failed to open models/llama/Drararara_llama-13B-ggml: Permission denied

Open madhack2142 opened this issue 1 year ago • 4 comments

models that tried to load, same error log when_trying_to_request.txt

May 12 '23 15:05 madhack2142

Same issue here...

May 12 '23 15:05 DielynLandel

Try to put the .bin files directly into the models/llama directory, the script doesn't support walking through folders yet

May 13 '23 17:05 hc20k

Try to put the .bin files directly into the models/llama directory, the script doesn't support walking through folders yet

Hello! Thanks for respond!

We tried several models such as ggml-model-q4_0.bin by Drararara.

After a little more than a minute a simple request "Hi!" gives out nonsense:

We only checked the text chat. The bot is silent in the voice chat. All of requirements was installed correctly.

System:

AMD Ryzen 9 5950X
32GB RAM
Nvidia RTX3090 (24GB)

May 14 '23 19:05 DielynLandel

DielynLandel

This may be due to the temperature / frequency penalty / presence penalty in the config, I find that a higher frequency penalty (1.1) is better for LLaMA models. Personally I use these settings for LLaMA:

temperature = 0.8
presence_penalty = 0.4
max_tokens = 0
frequency_penalty = 1.1

I've tested it with pygmalion-7b, wizardlm-7b/13b and this works pretty well.

It also helps LLaMA if you provide a short chat example in your initial prompt, like so:

...your initial prompt

{user_name}: Hello!
{bot_name}: Hi, how's it going today?
{user_name}: Fine, how about you?
{bot_name}: I'm doing well too.

Please let me know if this works out for you, if not I will be happy to help you out some more.

May 15 '23 17:05 hc20k

LLMChat LLMChat copied to clipboard

error loading model: failed to open models/llama/Drararara_llama-13B-ggml: Permission denied

Hello! Thanks for respond!

LLMChat
LLMChat copied to clipboard