Wuyingwen

Results 2 issues of Wuyingwen

### Question MoE 第三阶段在示意图中画的是 projector(MLP) 不训练,但是实际代码中 QWen-Stage2 的预训练模型的 freeze_mm_mlp_adapter=False,也就是说第三阶段 mm_projector 的参数也会更新。请问这个冲突如何解释?

**Describe the bug** 使用swift sft 命令微调MiniCPM-v-2.6模型时,训练到中途突然报错: Some NCCL operations have failed or timed out. Due to the asynchronous nature of CUDA kernels, subsequent GPU operations might run on corrupted/incomplete data....

bug