efficient-model topics

amc

419

Stars

108

Forks

Watchers

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

mit-han-lab

automl

automl-for-compression

channel-pruning

efficient-model

nn-Meter

321

Stars

56

Forks

Watchers

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

microsoft

deep-learning

deep-neural-networks

edge-ai

edge-computing

temporal-shift-module

2.0k

Stars

418

Forks

Watchers

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

mit-han-lab

acceleration

efficient-model

low-latency

nvidia-jetson-nano

proxylessnas

1.4k

Stars

282

Forks

Watchers

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

mit-han-lab

acceleration

automl

efficient-model

hardware-aware

hardware-aware-transformers

323

Stars

48

Forks

Watchers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

mit-han-lab

efficient-model

hardware-aware

machine-translation

natural-language-processing

once-for-all

1.8k

Stars

332

Forks

Watchers

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

mit-han-lab

acceleration

automl

edge-ai

efficient-model

haq

354

Stars

84

Forks

Watchers

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

mit-han-lab

automl

efficient-model

mixed-precision

quantization

ZeroQ

270

Stars

54

Forks

Watchers

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

amirgholami

compression

efficient-model

efficient-neural-networks

quantization

I-BERT

212

Stars

30

Forks

Watchers

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

kssteven418

bert

efficient-model

efficient-neural-networks

model-compression

amc-models

163

Stars

27

Forks

Watchers

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

mit-han-lab

automl

efficient-model

model-compression

on-device-ai