aibrix [RFC]: stormservice support vllm and sglang multiple-node deployment

Currently, we lack a unified multi-machine deployment solution. Raycluster currently supports multi-machine deployment using vllm, but does not support multi-machine deployment using sglang. The LeaderWorkerSet recommended by sglang lacks flexibility, so it is possible to use stormservice to support both types of inference.

Oct 11 '25 02:10 ying2025

Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for?

Oct 11 '25 17:10 Jeffwan

Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for?

Firstly, podGroupSize should be used to enable smoother deployment of distributed inference for vllm. Secondly, the node rank requirements for sglang distributed inference should also be met.

Oct 13 '25 02:10 ying2025

@ying2025 got you. I will ask some contributors to write some concrete examples and explanation. the podGroupSize is a little bit confusing to most users. We will make sure this is clearly explained this time.

Oct 13 '25 04:10 Jeffwan