RL4LMs icon indicating copy to clipboard operation
RL4LMs copied to clipboard

Do you have any plans to apply the recently published Reinforced Self-Training (ReST)?

Open missflash opened this issue 1 year ago • 0 comments

Do you have any plans to apply the recently published Reinforced Self-Training (ReST)?

Reinforced Self-Training (ReST) for Language Modeling https://arxiv.org/abs/2308.08998

missflash avatar Sep 08 '23 02:09 missflash