llama2gptq
llama2gptq copied to clipboard

Published 20 hours ago •

→

Metadata

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

Reame
Issues

Results 5 llama2gptq issues

Sort by recently updated

deps: renovate dependency dashboard

This issue lists Renovate updates and detected dependencies. Read the [Dependency Dashboard](https://docs.renovatebot.com/key-concepts/dashboard/) docs to learn more. ## Open These updates have all been created already. Click a checkbox below to...

renovate[bot]

feat: text streaming using streamer or criterion

https://huggingface.co/docs/transformers/internal/generation_utils#transformers.TextStreamer

seonglae

model: auto gptq qunatization from knowledge distillation

seonglae

data: url bases new db generation of texonom

1

seonglae

feat: web ui support using fastapi & next.js

1

https://github.com/zahidkhawaja/langchain-chat-nextjs

seonglae

About

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

chatbot

question-answering

transformers

cuda

gpt

quantization

model-quantization

rye

chatai

streamlit-chat

chatgpt

langchain

llama2

llama-2

30

Stars

0

Forks

Watchers

Owner

seonglae

← Metadata

30

Stars

0

Forks

Watchers

Owner

seonglae

Metadata

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

Back

llama2gptq llama2gptq copied to clipboard

Metadata

deps: renovate dependency dashboard

feat: text streaming using streamer or criterion

model: auto gptq qunatization from knowledge distillation

data: url bases new db generation of texonom

feat: web ui support using fastapi & next.js

← Metadata

Owner

Metadata

llama2gptq
llama2gptq copied to clipboard