quanshr

Results 2 repositories owned by quanshr

DMoERM

18
Stars
0
Forks
18
Watchers

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

AugCon

15
Stars
0
Forks
Watchers

Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity