small-models topic
List
small-models repositories
SqueezeLLM
632
Stars
42
Forks
Watchers
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
KVQuant
286
Stars
25
Forks
Watchers
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
openssa
201
Stars
31
Forks
Watchers
OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving