DreamReward
DreamReward copied to clipboard
plans to open source the training code?
Very creative and interesting work. Can you open source the training code of the reward model?