Ziniu Li

Results 3 repositories owned by Ziniu Li

ReMax

120
Stars
9
Forks
Watchers

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

RL-PPO-Keras

15
Stars
12
Forks
Watchers

Proximal Policy Optimization(PPO) with Keras Implementation

policy_optimization

23
Stars
2
Forks
Watchers

Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)