Mulit-modal rl training support?
Will support with multimodal training in rl?
Could you provide a list of models you would like us to support? Please also rank them. Thanks.
Hi, I think support LLaVA could be enough.
Qwen vl 2 and 2.5 would be great 👍
It is great to support Qwen2-VL and Qwen2.5-VL, which are two popular series of models when exploring "aha moment" in MLLM. Thank you for your great work.
qwen 2/2.5 vl please. Thanks.
qwen vl series is being worked on by a member from the community. the example runs fine and will be soon open sourced
Can InternVL2.5 be added to the supported models?
希望LLaVA也被支持,谢谢
qwen 2.5 vl is supported and available in https://github.com/volcengine/verl/blob/main/examples/grpo_trainer/run_qwen2_5_vl-7b.sh Contribution from the community is highly appreciated
Get it! Thank you for your great works!
---Original--- From: @.> Date: Mon, Apr 7, 2025 03:11 AM To: @.>; Cc: "Yuanze @.@.>; Subject: Re: [volcengine/verl] Mulit-modal rl training support? (Issue #168)
qwen 2.5 vl is supported and available in https://github.com/volcengine/verl/blob/main/examples/grpo_trainer/run_qwen2_5_vl-7b.sh Contribution from the community is highly appreciated
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***> eric-haibin-lin left a comment (volcengine/verl#168)
qwen 2.5 vl is supported and available in https://github.com/volcengine/verl/blob/main/examples/grpo_trainer/run_qwen2_5_vl-7b.sh Contribution from the community is highly appreciated
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>
Hi, I think support LLaVA could be enough.
Llava grpo is supported and available in https://github.com/PRIS-CV/GRPO-for-Llava ❤️
Could you also support https://huggingface.co/Qwen/Qwen2.5-Omni-7B and https://huggingface.co/Qwen/Qwen2-Audio-7B ?
Thank you very much! Audio modality is also very helpful.
mark
Could you also support https://huggingface.co/Qwen/Qwen2.5-Omni-7B and https://huggingface.co/Qwen/Qwen2-Audio-7B ?
Omni support is on the way :) https://github.com/volcengine/verl/issues/2852
Would also be great to see the new Qwen3-Omni models: https://huggingface.co/collections/Qwen/qwen3-omni-68d100a86cd0906843ceccbe. This may be complicated by those being MoE
希望llava-video-qwen2也可以被支持