Jie Cheng issues

Repositories
Issues
Comments

Results 1 issues of


                                            Jie Cheng

Improve PRM inference speed

This pr serves a similar purpose as [this](https://github.com/RLHFlow/RLHF-Reward-Modeling/pull/48), in order to increase the speed of prm evaluation. But instead of modifying the content of the conversation (which can lead to...