scalable-oversight topic

List scalable-oversight repositories

ALaRM

22
Stars
2
Forks
Watchers

[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"