quantization topic
navec
Compact high quality word embeddings for Russian language
brevitas
Brevitas: neural network quantization in PyTorch
finn
Dataflow compiler for QNN inference on FPGAs
sparsezoo
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
qkeras
QKeras: a quantization deep learning library for Tensorflow Keras
aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
awesome-ml-model-compression
Awesome machine learning model compression research papers, tools, and learning material.
sparsify
ML model optimization product to accelerate inference.