SYing
SYing
@Jeffwan Sglang: https://github.com/vllm-project/aibrix/blob/main/samples/disaggregation/sglang/tp-1p1d.yaml I add the podgroupsize, and I change the command ``` replicas: 2 podGroupSize: 2 ``` ``` python3 -m sglang.launch_server \ --model-path /models/qwen/Qwen3-8B \ --served-model-name qwen3-8B-tp \ --host...
> Sure. I think we just need to add a vLLM multi-node example using stormservice. is that what you are looking for? Firstly, podGroupSize should be used to enable smoother...
1. Httproute should update if the deployment or rayclusterfleet label `modelIdentifier` change with `https://github.com/vllm-project/aibrix/blob/main/pkg/controller/modelrouter/modelrouter_controller.go#L87`, and if split deployment identifier from model deployment, maybe the label `modelIdentifier` not need to chang....
> [@ying2025](https://github.com/ying2025) I really appreciate the feedback. We will add those issues to the release story. It would be great to link the existing if you already create them. We...