verl Is there a way to turn off Qwen3 32B thinking mode and add a thinking budget?

HI Team,

Thanks a lot for your excellent work!

I have two separate questions:

Is there a way in VeRL to turn off Qwen3 32B thinking mode?
Is there a way in VeRl to pass in the argument of thinking budget?

Thanks!

Jun 15 '25 02:06 tjoymeed

yeah this will be very useful to have. Having thinking budget for Qwen3 during RL tarining

Jun 29 '25 02:06 mihirp1998

tokenizer_config.json -> chat_template -> {%- if enable_thinking is not defined or enable_thinking is false %}

Oct 10 '25 08:10 DenceChen

tokenizer_config.json -> chat_template -> {%- if enable_thinking is not defined or enable_thinking is false %}

Will this method work when using vllm_sync for rollout? I couldn't find any implementation of vllm_sync_rollout that uses tokenizer.apply_chat_template.

Nov 12 '25 05:11 Fdioa

HI Team,

Thanks a lot for your excellent work!

I have two separate questions:

Is there a way in VeRL to turn off Qwen3 32B thinking mode?

Is there a way in VeRl to pass in the argument of thinking budget?

Thanks!

for question 1, set +data.apply_chat_template_kwargs='{enable_thinking:false}' \

Nov 25 '25 07:11 Chenruishuo