small-models topic

List small-models repositories

SqueezeLLM

577
Stars
37
Forks
Watchers

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

KVQuant

212
Stars
16
Forks
Watchers

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

openssa

149
Stars
24
Forks
Watchers

OpenSSA: Small Specialist Agents—Enabling Efficient, Domain-Specific Planning + Reasoning for AI