RL
RL copied to clipboard
Add an option to save checkpoints in HF format
Is your feature request related to a problem? Please describe. Since checkpoints are often converted to HF format for evaluation, add a config to optionally save HF format checkpoints in addition to the mcore / dtensor checkpoints. This prevents the user from having to run a manual conversion step afterwards.
Describe the solution you'd like A clear and concise description of what you want to happen.
Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.
Additional context Add any other context or screenshots about the feature request here.