HaochenZhao

Results 1 issues of HaochenZhao

I am working on a project about applying RL to LLM but only have very limited resource. Hope verl team can support peft methods like lora on your ppo trainer