vLLM

Results 2 repositories owned by


                                            vLLM

57.1k

Stars

9.9k

Forks

435

Watchers

A high-throughput and memory-efficient inference and serving engine for LLMs

3.3k

Stars

310

Forks

Watchers

Cost-efficient and pluggable Infrastructure components for GenAI inference