verl icon indicating copy to clipboard operation
verl copied to clipboard

Is there a way to turn off Qwen3 32B thinking mode and add a thinking budget?

Open tjoymeed opened this issue 6 months ago • 4 comments

HI Team,

Thanks a lot for your excellent work!

I have two separate questions:

  1. Is there a way in VeRL to turn off Qwen3 32B thinking mode?
  2. Is there a way in VeRl to pass in the argument of thinking budget?

Thanks!

tjoymeed avatar Jun 15 '25 02:06 tjoymeed

yeah this will be very useful to have. Having thinking budget for Qwen3 during RL tarining

mihirp1998 avatar Jun 29 '25 02:06 mihirp1998

tokenizer_config.json -> chat_template -> {%- if enable_thinking is not defined or enable_thinking is false %}

DenceChen avatar Oct 10 '25 08:10 DenceChen

tokenizer_config.json -> chat_template -> {%- if enable_thinking is not defined or enable_thinking is false %}

Will this method work when using vllm_sync for rollout? I couldn't find any implementation of vllm_sync_rollout that uses tokenizer.apply_chat_template.

Fdioa avatar Nov 12 '25 05:11 Fdioa

HI Team,

Thanks a lot for your excellent work!

I have two separate questions:

  1. Is there a way in VeRL to turn off Qwen3 32B thinking mode?
  2. Is there a way in VeRl to pass in the argument of thinking budget?

Thanks!

for question 1, set +data.apply_chat_template_kwargs='{enable_thinking:false}' \

Chenruishuo avatar Nov 25 '25 07:11 Chenruishuo