Ziniu Li
Results
3
repositories owned by
Ziniu Li
ReMax
120
Stars
9
Forks
Watchers
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
RL-PPO-Keras
15
Stars
12
Forks
Watchers
Proximal Policy Optimization(PPO) with Keras Implementation
policy_optimization
23
Stars
2
Forks
Watchers
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)