coax icon indicating copy to clipboard operation
coax copied to clipboard

Example of using this lib for RLHF?

Open asmith26 opened this issue 2 years ago • 0 comments

Just wondering if there are any example of using this lib for implement RLHF (Reinforcement Learning from Human Feedback)?

Inspired by: https://openai.com/blog/chatgpt image

Many thanks for any help! :)

asmith26 avatar Apr 04 '23 17:04 asmith26