Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

add training code for reward model

Open theblackcat102 opened this issue 2 years ago • 1 comments
trafficstars

trainer code to train a single score reward model. Currently support webgpt and raw datasets from humanfeed back summary by openai. See readme and rank_datasets.py for more details.

theblackcat102 avatar Jan 01 '23 02:01 theblackcat102

@yk yeah, it's my problem. just reset the format setting

theblackcat102 avatar Jan 01 '23 13:01 theblackcat102