Roger Wang comments

Results 132 comments of


                                            Roger Wang

[Bug]: PaliGemma detection task is failing

Thanks for reporting! I will take a look at this issue.

[Bug]: PaliGemma detection task is failing

Hey I just want to let you know this is still on my list of TODO but I simply had limited bandwidth with other priorities. Sorry for the inconvenience!

[RFC][V1] `LogitsProcessor` interface

@njhill Can we close this PR?

[Bug]: some questions regarding the usage of NCCL allreduce/broadcast/allgather/send/recv in VLLM using pycomm and torch's distributed.

@youkaichao I think you might have the most context in this?

[Core][VLM] Add support for placeholder token content hashes

Sorry for the delay - I was busy with Pixtral release last week but will review this PR this week!

[Core][VLM] Add support for placeholder token content hashes

@cooleel We decided to work on adding prefix caching for multimodal models on V1 instead since there are some fundamental changes on how cache manager is designed. Stay tuned and...

[RFC]: Deprecate stop_reason in OpenAI Entrypoint in favor of finish_reason; fix implementation of finish_reason

Thank you for this RFC - imo we should support OpenAI compatibility at our best effort so working on `finished_reason` aligns with our goal. I do think there's a value...

[RFC]: Support for video input

> > Is there a roadmap for 'openai vllm server supports video interface' feature? Thank you > > Thank you. It is a very critical feature, but it needs a...

[RFC]: Support for video input

> @ywang96 I am quite interested in using `vllm` for high-performance video captioning. This will be tremendously helpful for furthering research on video generation from language. > > I was...

[RFC]: Support for video input

> > AFAIK, only Llava-OneVision supports multi-video captioning once #8905 is merged, and this is more of a model capability than of the inference infrastructure capability > > Agree. Thanks...