vicuna-installation-guide how to generate a llama.cpp server with fastchat api

how to generate a llama.cpp server with fastchat api

Open xx-zhang opened this issue 2 years ago • 1 comments

trafficstars

I have set the server , but only few words output like blocked , and is a single progress which can't reponse fastly. it is only run and load model when the request is getting.

Jun 27 '23 01:06 xx-zhang

How did you setup the server?

Jun 27 '23 23:06 fredi-python

vicuna-installation-guide vicuna-installation-guide copied to clipboard

how to generate a llama.cpp server with fastchat api

vicuna-installation-guide
vicuna-installation-guide copied to clipboard