efficient-inference topic

List efficient-inference repositories

graphless-neural-networks

80

Stars

20

Forks

Watchers

[ICLR 2022] Code for Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation (GLNN)

efficient-inference

SqueezeLLM

632

Stars

42

Forks

Watchers

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

efficient-inference

large-language-models

DeepCache

767

Stars

36

Forks

Watchers

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-models

efficient-inference

model-compression

stable-diffusion

LLMCompiler

1.4k

Stars

104

Forks

Watchers

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

efficient-inference

function-calling

large-language-models

BigLittleDecoder

85

Stars

10

Forks

Watchers

[NeurIPS'23] Speculative Decoding with Big Little Decoder

efficient-inference

speculative-decoding

160

Stars

14

Forks

Watchers

Explorations into some recent techniques surrounding speculative decoding

artificial-intelligence

efficient-inference

lzu

46

Stars

5

Forks

Watchers

Code for Learning to Zoom and Unzoom (CVPR 2023)

autonomous-driving

efficient-inference

spatial-attention

TinyML-Benchmark-NNs-on-MCUs

31

Stars

11

Forks

Watchers

Code for WF-IoT paper 'TinyML Benchmark: Executing Fully Connected Neural Networks on Commodity Microcontrollers'

bharathsudharsan

triple-wins

24

Stars

7

Forks

Watchers

[ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“

adversarial-attacks

adversarial-robustness

efficient-inference

LightGaussian

551

Stars

49

Forks

Watchers

[NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang

3d-reconstruction

efficient-inference

gaussian-splatting