supermancmk

Results 5 comments of supermancmk

> > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes...

> > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the model crashes...

> > > > I pulled the latest version of verl's code and when running the official gsm8k with tool, multi turn async rollout sglang example without any modifications, the...

> I think the verl sglang multi-turn tool calling is working btw. https://github.com/volcengine/verl/blob/54b2677/examples/sglang_multiturn/README.md但我认为 verl sglang 多轮工具调用是正常的。https://github.com/volcengine/verl/blob/54b2677/examples/sglang_multiturn/README.md May I ask if you train normally? How many steps did you train and...

Could you support qwen to use Megatron for SFT training?