Gym
Gym copied to clipboard
question: how to perform multi-verifier training
Do we support training with multiple environments in NeMo Gym? For example, the model training needs both google_search and workplace_assistant? How to configure the datasets for each environment for multi-verifier training?