reward-learning topic
List
reward-learning repositories
imitation
1.2k
Stars
225
Forks
Watchers
Clean PyTorch implementations of imitation and reward learning algorithms
optimas
63
Stars
7
Forks
63
Watchers
Optimize Any User-defined Compound AI Systems
learning-from-rewards-llm-papers
59
Stars
2
Forks
59
Watchers
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...