model-compression topic

List model-compression repositories

MicroNet_OSI-AI

18

Stars

6

Forks

Watchers

(NeurIPS-2019 MicroNet Challenge - 3rd Winner) Open source code for "SIPA: A simple framework for efficient networks"

adaptive-computation

compact-neural-network

research-paper-summaries

17

Stars

0

Forks

Watchers

A directory with some interesting research paper summaries in the field of Deep Learning

adversarial-networks

knowledge-distillation

model-compression

laser

361

Stars

26

Forks

Watchers

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

pratyushasharma

interpretability

Multistage_Pruning

16

Stars

3

Forks

Watchers

Cheng-Hao Tu, Jia-Hong Lee, Yi-Ming Chan and Chu-Song Chen, "Pruning Depthwise Separable Convolutions for MobileNet Compression," International Joint Conference on Neural Networks, IJCNN 2020, July 20...

channel-pruning

deep-neural-networks

depthwise-separable-convolutions

efficient-inference

QuantEase

17

Stars

1

Forks

Watchers

QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coordinate Descent techniques, offering high-quality solutions wit...

large-language-models

model-compression

publication-code

task-aware-distillation

20

Stars

3

Forks

Watchers

Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)

knowledge-distillation

language-models

model-compression

KVQuant

286

Stars

25

Forks

Watchers

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

efficient-inference

efficient-model

large-language-models

CPSCA

16

Stars

4

Forks

Watchers

Code for paper "Channel Pruning Guided by Spatial and Channel Attention for DNNs in Intelligent Edge Computing"

attention-mechanism

channel-pruning

awesome-compression

54

Stars

10

Forks

Watchers

模型压缩的小白入门教程

knowledge-distillation

model-compression

Pruning-Deep-Neural-Networks-from-a-Sparsity-Perspective

19

Stars

2

Forks

Watchers

[ICLR 2023] Pruning Deep Neural Networks from a Sparsity Perspective

model-compression

neural-network-compression