SqueezeAILab

Results 4 repositories owned by SqueezeAILab

SqueezeLLM

632
Stars
42
Forks
Watchers

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

LLMCompiler

1.4k
Stars
104
Forks
Watchers

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

KVQuant

286
Stars
25
Forks
Watchers

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

LLM2LLM

149
Stars
11
Forks
Watchers

[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement