quantization topic

List quantization repositories

navec

170
Stars
16
Forks
Watchers

Compact high quality word embeddings for Russian language

brevitas

1.1k
Stars
186
Forks
Watchers

Brevitas: neural network quantization in PyTorch

finn

674
Stars
212
Forks
Watchers

Dataflow compiler for QNN inference on FPGAs

sparsezoo

361
Stars
23
Forks
Watchers

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

neural-compressor

2.2k
Stars
254
Forks
Watchers

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

qkeras

527
Stars
101
Forks
Watchers

QKeras: a quantization deep learning library for Tensorflow Keras

aimet

2.0k
Stars
353
Forks
Watchers

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

nncf

834
Stars
211
Forks
Watchers

Neural Network Compression Framework for enhanced OpenVINO™ inference

awesome-ml-model-compression

448
Stars
58
Forks
Watchers

Awesome machine learning model compression research papers, tools, and learning material.

sparsify

315
Stars
27
Forks
Watchers

ML model optimization product to accelerate inference.