Clean PyTorch implementations of imitation and reward learning algorithms
HumanCompatibleAI
Optimize Any User-defined Compound AI Systems
snap-stanford