model-quantization topic

List model-quantization repositories

Adventures-in-TensorFlow-Lite

168
Stars
33
Forks
Watchers

This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.

awesome-model-quantization

1.7k
Stars
200
Forks
Watchers

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...

psychopathology-fer-assistant

71
Stars
25
Forks
Watchers

[WINNER! 🏆] Psychopathology FER Assistant. Because mental health matters. My project submission for #TFWorld TF 2.0 Challenge at Devpost.

BiBench

47
Stars
3
Forks
Watchers

This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

QuantSR

33
Stars
2
Forks
Watchers

This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.

awesome-efficient-aigc

109
Stars
10
Forks
Watchers

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcom...

OFQ

26
Stars
0
Forks
Watchers

The official implementation of the ICML 2023 paper OFQ-ViT

llama2gptq

30
Stars
0
Forks
Watchers

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

Awesome-Efficient-LLM

861
Stars
65
Forks
Watchers

A curated list for Efficient Large Language Models

inferflow

227
Stars
21
Forks
Watchers

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).