reinforcement-learning-from-human-feedback topic

List reinforcement-learning-from-human-feedback repositories

Okapi

83
Stars
2
Forks
Watchers

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

safe-rlhf

1.2k
Stars
104
Forks
Watchers

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

alpaca_farm

721
Stars
56
Forks
Watchers

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

minichatgpt

15
Stars
1
Forks
Watchers

annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

OpenRLHF

1.3k
Stars
123
Forks
Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

llm_optimization

21
Stars
0
Forks
Watchers

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.