model-compression topic

List model-compression repositories

SVD-LLM

87
Stars
7
Forks
Watchers

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

logit-standardization-KD

305
Stars
12
Forks
Watchers

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

LumiNet

16
Stars
2
Forks
Watchers

The official implementation of LumiNet: The Bright Side of Perceptual Knowledge Distillation https://arxiv.org/abs/2310.03669

Pruning-LLMs

94
Stars
1
Forks
Watchers

The framework to prune LLMs to any size and any config.

OpenBA-v2

18
Stars
0
Forks
Watchers

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

Awesome-Diffusion-Distillation

15
Stars
0
Forks
Watchers

A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.

picollm

273
Stars
15
Forks
273
Watchers

On-device LLM Inference Powered by X-Bit Quantization

MoA

80
Stars
5
Forks
Watchers

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

Awesome-Tensor-Decomposition

46
Stars
4
Forks
Watchers

😎 A curated list of tensor decomposition resources for model compression.

Awesome-Token-level-Model-Compression

183
Stars
7
Forks
183
Watchers

📚 Collection of token-level model compression resources.