efficient-model topic

List efficient-model repositories

SDPoint

18
Stars
4
Forks
Watchers

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

ABCNet

29
Stars
3
Forks
Watchers

The semantic segmentation of remote sensing images

NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".

KVQuant

286
Stars
25
Forks
Watchers

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

owq

60
Stars
7
Forks
Watchers

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

SVD-LLM

87
Stars
7
Forks
Watchers

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"