reward-modeling topic
List
reward-modeling repositories
trafficstars
tasksource
144
Stars
7
Forks
Watchers
Datasets collection and preprocessings framework for NLP extreme multitask learning
DMoERM
15
Stars
0
Forks
Watchers
[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling