vllm topic

List vllm repositories

kubeai

361
Stars
33
Forks
Watchers

Private Open AI on Kubernetes

LightCompress

625
Stars
62
Forks
625
Watchers

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

vidur

241
Stars
27
Forks
Watchers

A large-scale simulation framework for LLM inference

harbor

396
Stars
19
Forks
Watchers

Effortlessly run LLM backends, APIs, frontends, and services with one command.

lm-fly

16
Stars
4
Forks
Watchers

大模型推理框架加速,让 LLM 飞起来

prometheus-eval

1.0k
Stars
66
Forks
1.0k
Watchers

Evaluate your LLM's response with Prometheus and GPT4 💯

ramalama

178
Stars
19
Forks
Watchers

The goal of ramalama is to make working with AI boring.

llmaz

270
Stars
44
Forks
270
Watchers

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

nextjs-vllm-ui

46
Stars
8
Forks
Watchers

Fully-featured, beautiful web interface for vLLM - built with NextJS.

grps

147
Stars
13
Forks
Watchers

【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接...