reinforcement-learning-from-human-feedback topic
List
reinforcement-learning-from-human-feedback repositories
Okapi
83
Stars
2
Forks
Watchers
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
safe-rlhf
1.2k
Stars
104
Forks
Watchers
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
alpaca_farm
721
Stars
56
Forks
Watchers
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
minichatgpt
15
Stars
1
Forks
Watchers
annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
OpenRLHF
1.3k
Stars
123
Forks
Watchers
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
llm_optimization
21
Stars
0
Forks
Watchers
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.