Chunjiang Ge (葛春江)

Results 39 comments of Chunjiang Ge (葛春江)

Maybe some papers following our work develop method for segmentation. You could check it.

出现这个提醒不影响性能。 你加了额外的special token对性能的影响取决于你的处理

训练的时候设定fps可以直接读取视频抽帧训练。

3vl 用的是相对坐标,可以参考 2vl 的

You could speed up training by setting: ```bash --data_flatten True \ ``` And reduce the max pixels.

Training with ViT-B should use AdamW as the optimizer. You could try some different lr settings.