lamacpp topic
List
lamacpp repositories
fastLLaMa
408
Stars
27
Forks
Watchers
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.
rag-chatbot
148
Stars
29
Forks
Watchers
RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.