Megatron-LM
Megatron-LM copied to clipboard
[BUG]`examples/multimodal/combine_mistral_clip.sh` Vision model file mismatch.
Describe the bug
The file format output by python examples/multimodal/clip_converter.py
does not match the file format required by examples/multimodal/combine_mistral_clip.sh
.
xxx\state_dict_tp_x.pt
is not xxx/iter_0000001/mp_rank_00/model_optim_rng.pt
?
To Reproduce
Expected behavior File format of the same name.
Stack trace/logs
Environment (please complete the following information):
- Megatron-LM commit ID: 9bcd4175becc515331537f0c78eb70079de0eaa8
- PyTorch version: 2.3.0a0+ebedce2
- CUDA version: 12.4
- NCCL version: 2.20.3
Proposed fix PR948