training
training copied to clipboard
Unable to run unit tests of distributed checkpointing in Megatron-LM
dist_checkpointing.config.add_argparse_args
does not exist.
@MingjiHan99 can you share more details and reproduction steps if this is still an issue?