vllm [New Model]: qwen2-audio

🚀 The feature, motivation and pitch

Do we have a plan to support voice models like qwen-audio?

Alternatives

No response

Additional context

No response

Before submitting a new issue...

[X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Sep 12 '24 03:09 seetimee

@fyabc are you interested in implementing this?

Sep 12 '24 04:09 DarkLight1337

@fyabc are you interested in implementing this?

Hi, our team are developing on Qwen2-Audio vllm support, please check this branch, and @faychu will take effort on it.

Sep 12 '24 04:09 fyabc

that's great

Sep 12 '24 05:09 seetimee

@fyabc are you interested in implementing this?

Hi, our team are developing on Qwen2-Audio vllm support, please check this branch, and @faychu will take effort on it.

this branch last commit is about two month ago, during that period,vllm has had several updates. Please do some new adaptation work

Sep 12 '24 07:09 seetimee

When do we expect to support qwen2 audio

Sep 14 '24 02:09 zhangfan-algo

@fyabc are you interested in implementing this?

Hi, our team are developing on Qwen2-Audio vllm support, please check this branch, and @faychu will take effort on it.

@fyabc Very excitied to see this release! Please feel free to ping me for review when the PR's up.

Sep 14 '24 07:09 ywang96

When do we expect to support qwen2 audio

We expect to support qwen2audio within one month.

Sep 18 '24 03:09 faychu

@faychu, Hi ,pro I follow your branch and have done some adaptation work with the latest version of VLLM. It worked well in a single audio per request scenario, but I am having trouble in the multi-audio per request scenario since the latest VLLM audio batch seems to have two dimensions: batch and sub-batch. Could you please give me some advice?

Sep 19 '24 08:09 seetimee

Any updates on this?

Oct 08 '24 15:10 jlia0

the latest version of VL

@faychu, Hi ,pro I follow your branch and have done some adaptation work with the latest version of VLLM. It worked well in a single audio per request scenario, but I am having trouble in the multi-audio per request scenario since the latest VLLM audio batch seems to have two dimensions: batch and sub-batch. Could you please give me some advice?

Can you share your adaptation work? Thanks

Oct 30 '24 10:10 lihuikenny

It seems qwen2-audio has already been well supported.You can just follow the https://github.com/vllm-project/vllm/pull/9248. above.

Oct 30 '24 10:10 seetimee