LLM-with-RL-papers icon indicating copy to clipboard operation
LLM-with-RL-papers copied to clipboard

add rrhf

Open GanjinZero opened this issue 1 year ago • 0 comments

RRHF: Rank Responses to Align Language Models with Human Feedback without tears

GanjinZero avatar May 04 '23 14:05 GanjinZero