pruning-algorithms topic
sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
torchprune
A research library for pytorch-based neural network pruning, compression, and more.
model_optimizer_tf
Model optimizer used in Adlik.
LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
FLAP
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models