ChenyuWang1022 comments

Repositories
Issues
Comments

Results 1 comments of


                                            ChenyuWang1022

How to register a new reward manager

This might be because the current Verl uses async mode by default, while the current agent loop uses the /verl/experimental/reward/reward_loop. Perhaps you could set `actor_rollout_ref.rollout.mode=sync` to use the default RewardManager...