kv-cache topic
List
kv-cache repositories
godis
3.4k
Stars
553
Forks
Watchers
A Golang implemented Redis Server and Cluster. Go 语言实现的 Redis 服务器和分布式集群
cappr
68
Stars
2
Forks
Watchers
Completion After Prompt Probability. Make your LLM make a choice
H2O
295
Stars
25
Forks
Watchers
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
pytorch-llama-notes
43
Stars
5
Forks
Watchers
Notes about LLaMA 2 model
EasyKV
56
Stars
4
Forks
Watchers
Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)
LLaMA2
48
Stars
7
Forks
Watchers
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture...