token-pruning topic
Moonlit
This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.
LightCompress
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.
vid-TLDR
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
HoliTom
[NeurIPS'25] HoliTom: Holistic Token Merging for Fast Video Large Language Models
twigvlm
Implementation of ICCV 2025 paper "Growing a Twig to Accelerate Large Vision-Language Models".
Awesome-Token-level-Model-Compression
📚 Collection of token-level model compression resources.