Kingsley comments

Results 80 comments of


                                            Kingsley

增量预训练qwen模型的时候，数据集的拼接之间是用<|endoftext|>还是<|im_end|>啊用的是Qwen3-1.7B这个chat模型不是base模型

you can refer to https://github.com/hiyouga/LLaMA-Factory/issues/8365#issuecomment-3251631509

Qwen3-VL-235B-A22B-thinking模型是否支持ppo训练？我已经被折磨很久了。help！

It is not recommend to use llamafactory to do ppo with moe-type model.

训练qwen-vl-2,5-3b:出现oom错误，但我的image质量确实比较高，调低cutofflength又会报shape mismatch

> 或者多卡并行有什么配置可以让token也切成四分分配到四个gpu上呢 Sequence parallelism will be supported in the next version.

训练qwen-vl-2,5-3b:出现oom错误，但我的image质量确实比较高，调低cutofflength又会报shape mismatch

是的

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

I reproduced your exp with following config without catching this issue. `hardware env: 2V100` ```yaml ### model model_name_or_path: ./Qwen2.5-Omni-7B image_max_pixels: 262144 video_max_pixels: 16384 trust_remote_code: true ### method stage: sft do_train:...

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

I am unfamiliar with MLLM role-playing, but I guess it is inconsistent with model training because these mllm models use a fixed textual system prompt (see their chat templates). Can...

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

Thanks for your explanation. I will check it later. I just added some prefix images in user column and restart this experiment without catching this. I think it should be...

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

> An alternative way is that, we put this role instruction on the first `user` message of a dialogue, and `assistant` reply "OK". And then the real chat begins from...

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

> Thank you for your experiment. > But I still want to know how to achieve mmdata in sys_msg when training.😂 Is it too hard? If it's too hard, then...

Hanging problem when LoRA finte-tune Qwen2.5Omni with multi-turn video-audio samples with deepspeedz3

> I'm now converting my videos into vision_only_videos following [#7638](https://github.com/hiyouga/LLaMA-Factory/pull/7638). And I guess the hanging problem with my own dataset is mainly caused by this. Hope I will come back...