reinforcement-learning-from-human-feedback topics

annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

clam004

deep-learning

deep-reinforcement-learning

fine-tuning

language-model

OpenRLHF

2.1k

Stars

206

Forks

Watchers

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

OpenRLHF

deepspeed

llm

ray

rlhf

CodeUltraFeedback

72

Stars

5

Forks

72

Watchers

CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)

martin-wey

alignment

codal-bench

code-generation

codeultrafeedback

llm_optimization

28

Stars

2

Forks

Watchers

A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.

tlc4418

best-of-n

deep-learning

ensembles

large-language-models

ReaLHF

95

Stars

4

Forks

Watchers

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

openpsi-project

deepspeed

distributed-computing

distributed-systems

large-language-models