Can I use ReFL method to train another dataset for a new reward model?

Open LOGIC-10 opened this issue 1 year ago • 1 comments

Hi, thanks for your fantastic work! I am trying to understand how to use the ReFL method to train a new reward model with a custom dataset. Specifically, I am unsure how to structure my dataset and integrate it into the current codebase. The current code reads image files in a way that is not immediately clear to me.

The current code reads image files in a way that seems to rely on specific identifiers rather than absolute file paths. This makes it unclear how to apply my own dataset. Thanks if you can give a guidance on how to ensure my dataset is correctly loaded and processed for training.

Thanks a lot!

Sep 17 '24 01:09 LOGIC-10

You can refer to this section and notice that there is a make dataset function to process data.

Oct 03 '24 02:10 xujz18