Megatron-LM icon indicating copy to clipboard operation
Megatron-LM copied to clipboard

[BUG]`examples/multimodal/combine_mistral_clip.sh` Vision model file mismatch.

Open Baibaifan opened this issue 7 months ago • 1 comments

Describe the bug The file format output by python examples/multimodal/clip_converter.py does not match the file format required by examples/multimodal/combine_mistral_clip.sh. xxx\state_dict_tp_x.pt is not xxx/iter_0000001/mp_rank_00/model_optim_rng.pt?

To Reproduce

Expected behavior File format of the same name.

Stack trace/logs

Environment (please complete the following information):

  • Megatron-LM commit ID: 9bcd4175becc515331537f0c78eb70079de0eaa8
  • PyTorch version: 2.3.0a0+ebedce2
  • CUDA version: 12.4
  • NCCL version: 2.20.3

Proposed fix PR948

Additional context

Baibaifan avatar Jul 25 '24 05:07 Baibaifan