lkchen
lkchen
`distributed-tests-4-gpus` already failing on [nightly build](https://buildkite.com/vllm/ci/builds/22323) and unrelated to this PR @njhill could you take a quick review?
As #13376 is merged, I think all TODOs are done, and this issue can be closed?
close in favor of #19836
`distributed-tests-4-gpus` already failing on [nightly build](https://buildkite.com/vllm/ci/builds/22323) `v1-test` already failing on [main](https://buildkite.com/vllm/ci/builds/22358#_) This PR should be mergable @njhill
@njhill v1-test passed https://buildkite.com/vllm/ci/builds/22459
Hi @NickLucche @wseaton and I both observed it [as mentioned in slack](https://vllm-dev.slack.com/archives/C08MCDXAS8Y/p1750352335005179?thread_ts=1750198629.534519&cid=C08MCDXAS8Y) I don't really know a stable setup that can reproduce it in real run, but only mimic it...
Thank you @NickLucche , I see your point, but at the moment I honestly don't have more insights how this happened
cc @Qasimk555 did you try patch https://github.com/ray-project/ray/pull/51726 ? The PR is planned to ship in 2.45 cc @GeneDer
@paolovic could you try vllm==0.8.4? 0.8.1 is known to have some bugs, and 0.8.4 fixed some security issue
https://github.com/vllm-project/vllm/pull/17084 removed sampler from model, this PR needs rebase. ~Let me see if I can help~ https://github.com/cjsdurj/vllm/pull/1