scalable-oversight topic
List
scalable-oversight repositories
ALaRM
22
Stars
2
Forks
Watchers
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"