model-quantization topic

List model-quantization repositories

Adventures-in-TensorFlow-Lite

168
Stars
33
Forks
Watchers

This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.

awesome-model-quantization

1.7k
Stars
200
Forks
Watchers

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...

psychopathology-fer-assistant

71
Stars
25
Forks
Watchers

[WINNER! 🏆] Psychopathology FER Assistant. Because mental health matters. My project submission for #TFWorld TF 2.0 Challenge at Devpost.

BiBench

54
Stars
4
Forks
Watchers

[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

QuantSR

33
Stars
2
Forks
Watchers

This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.

awesome-efficient-aigc

142
Stars
12
Forks
Watchers

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcom...

OFQ

27
Stars
0
Forks
Watchers

The official implementation of the ICML 2023 paper OFQ-ViT

llama2gptq

30
Stars
0
Forks
Watchers

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

Awesome-Efficient-LLM

1.2k
Stars
85
Forks
Watchers

A curated list for Efficient Large Language Models

inferflow

235
Stars
24
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).