RL4LMs
RL4LMs copied to clipboard
In the paper, what is the detail setting of supervised learning? Is SL has additional supervised data?
https://openreview.net/forum?id=8aHzds2uUyB
Thank you very much!
https://openreview.net/forum?id=8aHzds2uUyB
Thank you very much!