llm-serving topic

List llm-serving repositories

BulletServe

21
Stars
1
Forks
21
Watchers

Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration

Cynde

15
Stars
0
Forks
15
Watchers

A Framework For Intelligence Farming