InternVL icon indicating copy to clipboard operation
InternVL copied to clipboard

关于V3.5 8B模型的训练时间 the training time of of V3_5 8B model

Open MagicXiaoJing opened this issue 3 months ago • 3 comments

when I use the data to sft, there's about 1 million data consist of coco and my own data, when training with 8x80G A100 , in report it will training 1300+ hours , is it right?? or I did sth wrong with configs? 当我使用原始的数据(比如coco那些, 一共100多万)再加上自己的数据(整体5W左右)一起形成新的数据进行sft微调,使用8卡80G A100进行微调, 整体训练时间需要1300+多小时?这个合理么?还是我哪里的配置做错了? Image

Very thanks for Answering!

MagicXiaoJing avatar Sep 27 '25 06:09 MagicXiaoJing

我把pack ds 关掉了就好了

Xiaohui9607 avatar Sep 28 '25 03:09 Xiaohui9607

我把pack ds 关掉了就好了

这个和pack ds有什么关系?

MagicXiaoJing avatar Sep 28 '25 09:09 MagicXiaoJing

想请问微调脚本是哪个呢

wuzhaodongaipython avatar Oct 03 '25 07:10 wuzhaodongaipython