rlaif topic

List rlaif repositories

awesome-RLAIF

102
Stars
4
Forks
Watchers

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

distilabel

1.0k
Stars
65
Forks
Watchers

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

zero-shot-reward-models

31
Stars
7
Forks
Watchers

ZYN: Zero-Shot Reward Models with Yes-No Questions

Prompt-OIRL

25
Stars
5
Forks
Watchers

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning