kang sheng
kang sheng
Thanks, I already created the pull request and tested on my machine.
You can use mbrige to save transformers' type checkpoint. You can refer to this script:https://github.com/volcengine/verl/blob/main/examples/grpo_trainer/run_qwen3-235b_megatron_96gb.sh#L96.
It's related to mbridge. Maybe @ISEEKYAN can help.
cc @ETOgaosion
Can you test the recommanded script? https://github.com/volcengine/verl/blob/main/examples/grpo_trainer/run_qwen3moe-30b_megatron_96gb.sh