quantization-aware-training topic

List quantization-aware-training repositories

Adventures-in-TensorFlow-Lite

168
Stars
33
Forks
Watchers

This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.

micronet

2.2k
Stars
477
Forks
Watchers

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ar...

TinyNeuralNetwork

716
Stars
117
Forks
Watchers

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

neural-compressor

2.2k
Stars
254
Forks
24
Watchers

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

nncf

834
Stars
211
Forks
Watchers

Neural Network Compression Framework for enhanced OpenVINO™ inference

Sparsebit

321
Stars
39
Forks
Watchers

A model compression and acceleration toolbox based on pytorch.

frostnet

106
Stars
18
Forks
Watchers

FrostNet: Towards Quantization-Aware Network Architecture Search

torch-model-compression

249
Stars
41
Forks
Watchers

针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库

CNN_on_MCU

24
Stars
19
Forks
Watchers

Code for paper 'Multi-Component Optimization and Efficient Deployment of Neural-Networks on Resource-Constrained IoT Hardware'