quantization topics

navec

170

Stars

16

Forks

Watchers

Compact high quality word embeddings for Russian language

natasha

embeddings

glove

nlp

python

brevitas

1.1k

Stars

186

Forks

Watchers

Brevitas: neural network quantization in PyTorch

Xilinx

brevitas

fpga

hardware-acceleration

image-classification

finn

674

Stars

212

Forks

Watchers

Dataflow compiler for QNN inference on FPGAs

Xilinx

compiler

dataflow

fpga

neural-network

sparsezoo

361

Stars

23

Forks

Watchers

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

neuralmagic

computer-vision

deep-learning-algorithms

deep-learning-models

mobilenet

neural-compressor

2.2k

Stars

254

Forks

Watchers

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

intel

auto-tuning

deep-learning

knowledge-distillation

low-precision

qkeras

527

Stars

101

Forks

Watchers

QKeras: a quantization deep learning library for Tensorflow Keras

google

accelerator

asic-design

deep-learning

fpga

aimet

2.0k

Stars

353

Forks

Watchers

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

quic

auto-ml

compression

deep-learning

deep-neural-networks

nncf

834

Stars

211

Forks

Watchers

Neural Network Compression Framework for enhanced OpenVINO™ inference

openvinotoolkit

bert

classification

compression

hawq

awesome-ml-model-compression

448

Stars

58

Forks

Watchers

Awesome machine learning model compression research papers, tools, and learning material.

cedrickchee

awesome-list

machine-learning

model-compression

neural-networks

sparsify

315

Stars

27

Forks

Watchers

ML model optimization product to accelerate inference.

neuralmagic

automl

computer-vision

deep-learning-accelerator

image-classification