JunchenHuang777

Results 2 comments of JunchenHuang777

我也存在同样问题,相同数据集和配置,qwen3vl-8b 全参sft时长比qwen2.5vl-7b增加3倍以上

System Info gpu:8×H100 cuda:12.3 python3.10 **Package Version** accelerate 1.10.1 aiofiles 24.1.0 aiohappyeyeballs 2.6.1 aiohttp 3.13.1 aiosignal 1.4.0 annotated-types 0.7.0 antlr4-python3-runtime 4.9.3 anyio 4.11.0 async-timeout 5.0.1 attrs 25.4.0 audioread 3.0.1 av...