ptq topics

flexible-yolov5

658

Stars

120

Forks

Watchers

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam，dcn and so on), and tensorrt

Bobo-y

backbone

cbam

dcnv2

efficientnet

brevitas

1.1k

Stars

186

Forks

Watchers

Brevitas: neural network quantization in PyTorch

Xilinx

brevitas

fpga

hardware-acceleration

image-classification

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers adva...

sony

deep-learning

deep-neural-networks

machine-learning

network-compression

TensorRT_API

17

Stars

4

Forks

Watchers

Deep Learning Model Optimization Using by TensorRT API, window

yester31

cuda

detr

ptq

pytorch

llmc

308

Stars

32

Forks

Watchers

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

ModelTC

benchmark

deployment

evaluation

large-language-models

OutEffHop

18

Stars

2

Forks

Watchers

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

MAGICS-LAB

attention

attention-mechanism

hopfield-neural-network

icml-2024

MI-optimize

18

Stars

4

Forks

Watchers

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techniqu...

TsingmaoAI

benchmark

inference

large-language-model

llm

ptq topic

flexible-yolov5

brevitas

model_optimization

TensorRT_API

llmc

OutEffHop

MI-optimize