SYing comments

Results 64 comments of


                                            SYing

stormservice deploy use mutiple node

@Jeffwan Sglang: https://github.com/vllm-project/aibrix/blob/main/samples/disaggregation/sglang/tp-1p1d.yaml I add the podgroupsize, and I change the command ``` replicas: 2 podGroupSize: 2 ``` ``` python3 -m sglang.launch_server \ --model-path /models/qwen/Qwen3-8B \ --served-model-name qwen3-8B-tp \ --host...

[RFC]: stormservice support vllm and sglang multiple-node deployment

> Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for? Firstly, podGroupSize should be used to enable smoother...

v0.4.0 roadmap

1. Httproute should update if the deployment or rayclusterfleet label `modelIdentifier` change with `https://github.com/vllm-project/aibrix/blob/main/pkg/controller/modelrouter/modelrouter_controller.go#L87`, and if split deployment identifier from model deployment, maybe the label `modelIdentifier` not need to chang....

v0.4.0 roadmap

> [@ying2025](https://github.com/ying2025) I really appreciate the feedback. We will add those issues to the release story. It would be great to link the existing if you already create them. We...