ptq topic
flexible-yolov5
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
brevitas
Brevitas: neural network quantization in PyTorch
model_optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers adva...
TensorRT_API
Deep Learning Model Optimization Using by TensorRT API, window
LightCompress
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.
OutEffHop
[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
MI-optimize
mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniqu...
GERM
[ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.