small-models topic

List small-models repositories

SqueezeLLM

632
Stars
42
Forks
Watchers

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

KVQuant

286
Stars
25
Forks
Watchers

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

openssa

201
Stars
31
Forks
Watchers

OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving