reward-modeling topic

List reward-modeling repositories
trafficstars

tasksource

144
Stars
7
Forks
Watchers

Datasets collection and preprocessings framework for NLP extreme multitask learning

DMoERM

15
Stars
0
Forks
Watchers

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling