ModelTC

Results 4 repositories owned by ModelTC

lightllm

2.3k
Stars
191
Forks
Watchers

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

llmc

308
Stars
32
Forks
Watchers

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

QLLM

33
Stars
2
Forks
Watchers

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

TFMQ-DM

53
Stars
3
Forks
Watchers

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".