HaochenZhao
Results
1
issues of
HaochenZhao
I am working on a project about applying RL to LLM but only have very limited resource. Hope verl team can support peft methods like lora on your ppo trainer