ptq topic

List ptq repositories

flexible-yolov5

658
Stars
120
Forks
Watchers

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt

brevitas

1.1k
Stars
186
Forks
Watchers

Brevitas: neural network quantization in PyTorch

model_optimization

291
Stars
47
Forks
Watchers

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers adva...

TensorRT_API

17
Stars
4
Forks
Watchers

Deep Learning Model Optimization Using by TensorRT API, window

LightCompress

649
Stars
64
Forks
649
Watchers

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

OutEffHop

21
Stars
4
Forks
21
Watchers

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

MI-optimize

18
Stars
4
Forks
Watchers

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniqu...

GERM

17
Stars
2
Forks
17
Watchers

[ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.