small-models topic
List
small-models repositories
SqueezeLLM
577
Stars
37
Forks
Watchers
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
KVQuant
212
Stars
16
Forks
Watchers
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
openssa
149
Stars
24
Forks
Watchers
OpenSSA: Small Specialist Agents—Enabling Efficient, Domain-Specific Planning + Reasoning for AI