supermancmk
supermancmk
> > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes...
> > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes...
> > > > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the...
> I think the verl sglang multi-turn tool calling is working btw. https://github.com/volcengine/verl/blob/54b2677/examples/sglang_multiturn/README.md但我认为 verl sglang 多轮工具调用是正常的。https://github.com/volcengine/verl/blob/54b2677/examples/sglang_multiturn/README.md May I ask if you train normally? How many steps did you train and...
Could you support qwen to use Megatron for SFT training?