rlaif topic
List
rlaif repositories
awesome-RLAIF
102
Stars
4
Forks
Watchers
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
distilabel
1.0k
Stars
65
Forks
Watchers
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
zero-shot-reward-models
31
Stars
7
Forks
Watchers
ZYN: Zero-Shot Reward Models with Yes-No Questions
Prompt-OIRL
25
Stars
5
Forks
Watchers
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning