sglang topic

List sglang repositories

llmaz

270
Stars
44
Forks
270
Watchers

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

gpt_server

239
Stars
21
Forks
239
Watchers

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

Mooncake

4.3k
Stars
446
Forks
4.3k
Watchers

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

gpustack

4.1k
Stars
412
Forks
4.1k
Watchers

GPU cluster manager for optimized AI model deployment

MOSS-TTSD

1.0k
Stars
91
Forks
1.0k
Watchers

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech ge...

GPTQModel

902
Stars
130
Forks
902
Watchers

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

kvcached

682
Stars
67
Forks
682
Watchers

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

FlashTTS

557
Stars
72
Forks
557
Watchers

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

SpecForge

500
Stars
112
Forks
500
Watchers

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

InferenceMAX

383
Stars
55
Forks
383
Watchers

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS