llama2-webui Very slow generation

Very slow generation

Open jaslatendresse opened this issue 1 year ago • 1 comments

I am running this on Mac M1 16GB RAM using app.py for simple text generation. Using the llama.cpp from terminal is much faster but when I use the backend through app.py is very slow. Any ideas?

Dec 13 '23 18:12 jaslatendresse

llama2-webui llama2-webui copied to clipboard

Very slow generation

llama2-webui
llama2-webui copied to clipboard