llm-serving topic
List
llm-serving repositories
BulletServe
21
Stars
1
Forks
21
Watchers
Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration
Cynde
15
Stars
0
Forks
15
Watchers
A Framework For Intelligence Farming