verl icon indicating copy to clipboard operation
verl copied to clipboard

[Question] Does verl support muilti-round conversation RL training?

Open Jerry-hyl opened this issue 9 months ago • 1 comments

Does verl support muilti-round conversation RL training? if it does, which format should I set the dataset parquet files?

Jerry-hyl avatar Mar 04 '25 03:03 Jerry-hyl

It's not currently supported. See this issue:

https://github.com/volcengine/verl/issues/398

casper-hansen avatar Mar 04 '25 12:03 casper-hansen