verl
verl copied to clipboard
Support FSDP model ckpt loading and do evaluation on specific dataset: Issue #298
Can we incorporate this functionality into main_generation?
incorporate fsdp ckpt loading func into main_generation. fixes #298
any feedback?
fixed
May I kindly ask whether this pull request is anywhere closer to get merged?
It is really weird that FSDP checkpoints, in which format every official documentation and recipe recommend users to save their training artifacts, is incompatible with the verl.trainer.main_generation script. 🤦
@w568w I tested this when submitting the PR and it worked at the time. However, there hasn't been any feedback from the official team for a while. You can try this code, but I can't guarantee it will work since it's been some time since my last test.