efficient-model topic
amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
amc-models
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices