aibrix icon indicating copy to clipboard operation
aibrix copied to clipboard

[RFC]: stormservice support vllm and sglang multiple-node deployment

Open ying2025 opened this issue 2 months ago • 3 comments

Currently, we lack a unified multi-machine deployment solution. Raycluster currently supports multi-machine deployment using vllm, but does not support multi-machine deployment using sglang. The LeaderWorkerSet recommended by sglang lacks flexibility, so it is possible to use stormservice to support both types of inference.

ying2025 avatar Oct 11 '25 02:10 ying2025

Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for?

Jeffwan avatar Oct 11 '25 17:10 Jeffwan

Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for?

Firstly, podGroupSize should be used to enable smoother deployment of distributed inference for vllm. Secondly, the node rank requirements for sglang distributed inference should also be met.

ying2025 avatar Oct 13 '25 02:10 ying2025

@ying2025 got you. I will ask some contributors to write some concrete examples and explanation. the podGroupSize is a little bit confusing to most users. We will make sure this is clearly explained this time.

Jeffwan avatar Oct 13 '25 04:10 Jeffwan