kv-cache topic

List kv-cache repositories

godis

3.4k
Stars
553
Forks
Watchers

A Golang implemented Redis Server and Cluster. Go 语言实现的 Redis 服务器和分布式集群

cappr

68
Stars
2
Forks
Watchers

Completion After Prompt Probability. Make your LLM make a choice

H2O

295
Stars
25
Forks
Watchers

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

EasyKV

56
Stars
4
Forks
Watchers

Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

LLaMA2

48
Stars
7
Forks
Watchers

This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture...

LMCache

4.8k
Stars
526
Forks
30
Watchers

Supercharge Your LLM with the Fastest KV Cache Layer