training icon indicating copy to clipboard operation
training copied to clipboard

Unable to run unit tests of distributed checkpointing in Megatron-LM

Open MingjiHan99 opened this issue 1 year ago • 1 comments

dist_checkpointing.config.add_argparse_args does not exist.

MingjiHan99 avatar Jul 19 '23 22:07 MingjiHan99

@MingjiHan99 can you share more details and reproduction steps if this is still an issue?

ShriyaPalsamudram avatar Jul 31 '24 15:07 ShriyaPalsamudram