LLM-with-RL-papers
LLM-with-RL-papers copied to clipboard

Published 20 hours ago •

Reame
Issues

add rrhf

Open GanjinZero opened this issue 1 year ago • 0 comments

RRHF: Rank Responses to Align Language Models with Human Feedback without tears

May 04 '23 14:05 GanjinZero