verl
verl copied to clipboard
The set parameter actor.optim.lr does not work
the initialization is:
but when update the actor, the optimizer is changed to:
Did you resume checkpoint somehow?
Did you resume checkpoint somehow?
Thanks,I have solved the problem.
How did you solved this?