chat-with-gpt
chat-with-gpt copied to clipboard
Support for ggerganov/llama.cpp
Would be cool if it could support https://github.com/ggerganov/llama.cpp
Is it on your roadmap?
Maybe using https://cocktailpeanut.github.io/dalai/#/
Integrating with the Dalai API should be doable.
How well does Llama perform in a chat style / conversational format?
with https://alpaca-ai-custom5.ngrok.io it seems to perform well
Alpaca is explicitly trained for Instruct-style messages. LLAMA by default isn't ready for conversations (yet) without some clever prompt engineering.
I'd like to implement this via Dalai's API which now supports Alpaca, but this issue needs resolved first:
https://github.com/cocktailpeanut/dalai/issues/51
It's also a bit slow for longer chats. Ideally it would use llama.cpp's interactive mode instead of resending the whole context with each message, but I don't think this is implemented yet?