Hamed Emine
Hamed Emine
Fixed GGUF models here: https://huggingface.co/AI-Engine/Meta-Llama-3-8B-Instruct-GGUF/tree/main
> > For API I had to manually insert in completions.py the fields: 'skip_special_tokens': False, 'custom_stopping_strings': '""' > > as the other side doesnt insert those fields. I think the...
I have the same issue here
Hello, I was able to resolve this by using "ExLlamav2_HF" as the loader instead of "GPTQ-for-LLaMa", make sure to click Save Settings so it uses that next time it launches.
Edit CMD_FLAGS.txt in the root folder Add this line: --listen --listen-port=1234 (Change 1234 to the port of your choosing) Here is an example of CMD_FLAGS.txt ``` # Only used by...