RL-Restore
RL-Restore copied to clipboard
Problems when training on my own dataset
Hi, @yuke93
Thank you for sharing your codes. But I found that the reward of testset keeps zero when training on my own data. I have checked that there is no any problem about my data prepare.
Could you give some advice for this? Thank you in advance.