verl
verl copied to clipboard
Any support for peft methods on PPO?
I am working on a project about applying RL to LLM but only have very limited resource. Hope verl team can support peft methods like lora on your ppo trainer