speculative-decoding topic

List speculative-decoding repositories

intel-extension-for-transformers

2.1k
Stars
209
Forks
Watchers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

BigLittleDecoder

85
Stars
10
Forks
Watchers

[NeurIPS'23] Speculative Decoding with Big Little Decoder

SpecDec

31
Stars
0
Forks
Watchers

Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)

aphrodite-engine

1.0k
Stars
112
Forks
Watchers

Large-scale LLM inference engine

Sequoia

305
Stars
31
Forks
Watchers

scalable and robust tree-based speculative decoding algorithm

TriForce

209
Stars
12
Forks
Watchers

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

EAGLE

780
Stars
79
Forks
Watchers

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

speculative_decoding.c

16
Stars
2
Forks
Watchers

minimal C implementation of speculative decoding based on llama2.c

REST

163
Stars
10
Forks
Watchers

REST: Retrieval-Based Speculative Decoding, NAACL 2024

Speculative-Decoding

18
Stars
0
Forks
Watchers

Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.