verl
verl copied to clipboard
Support multi validation datasets
Current setup is to combine all validation datasets into one which assume their columns and feature types are exactly same. However, for VLM case, we want to evaluate model performance on multiple datasets which are not sharing same columns and feature types. For example, datasets with text only, datasets with images and dataset with video input.
I can work on supporting this.