ModelTC

Results 4 repositories owned by ModelTC

lightllm

2.3k
Stars
191
Forks
Watchers

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

LightCompress

625
Stars
62
Forks
625
Watchers

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

QLLM

33
Stars
2
Forks
Watchers

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

TFMQ-DM

53
Stars
3
Forks
Watchers

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".