llama2gptq
llama2gptq copied to clipboard
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
This issue lists Renovate updates and detected dependencies. Read the [Dependency Dashboard](https://docs.renovatebot.com/key-concepts/dashboard/) docs to learn more. ## Open These updates have all been created already. Click a checkbox below to...
https://huggingface.co/docs/transformers/internal/generation_utils#transformers.TextStreamer
https://github.com/zahidkhawaja/langchain-chat-nextjs