reward-learning topic

List reward-learning repositories

imitation

1.2k
Stars
225
Forks
Watchers

Clean PyTorch implementations of imitation and reward learning algorithms

optimas

63
Stars
7
Forks
63
Watchers

Optimize Any User-defined Compound AI Systems

learning-from-rewards-llm-papers

59
Stars
2
Forks
59
Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...