Ziniu Li
Results
3
repositories owned by
Ziniu Li
ReMax
143
Stars
13
Forks
Watchers
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
RL-PPO-Keras
15
Stars
12
Forks
Watchers
Proximal Policy Optimization(PPO) with Keras Implementation
policy_optimization
23
Stars
3
Forks
Watchers
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)