heli-qi
Results
2
comments of
heli-qi
Thank you for your brilliant work! I’m interested in using Gemma3 as the base to perform RL training and I have tested your PR committed codes on my own machine....
Thank you for your suggestion! I have lowered down my transformers to 4.51.3 and observed reasonable rollout output by setting `actor_rollout_ref.rollout.load_format=auto`. Also, I noticed your latest commit and I have...