ptq topic

List ptq repositories

flexible-yolov5

658
Stars
120
Forks
Watchers

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt

brevitas

1.1k
Stars
186
Forks
Watchers

Brevitas: neural network quantization in PyTorch

model_optimization

291
Stars
47
Forks
Watchers

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers adva...

TensorRT_API

17
Stars
4
Forks
Watchers

Deep Learning Model Optimization Using by TensorRT API, window

llmc

308
Stars
32
Forks
Watchers

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

OutEffHop

18
Stars
2
Forks
Watchers

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

MI-optimize

18
Stars
4
Forks
Watchers

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniqu...