llm-inference topic
cappr
Completion After Prompt Probability. Make your LLM make a choice
LLMUnity
Create characters in Unity with LLMs!
ht
ht - a shell command that answers your questions about shell commands
Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
embedding_studio
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
LeanCopilot
LLMs as Copilots for Theorem Proving in Lean