DFormer icon indicating copy to clipboard operation
DFormer copied to clipboard

Congratulations on CVPR Acceptance! A Few Questions about DFormer-V2 Training and DELIVER Dataset Support

Open alkaidzc opened this issue 8 months ago • 3 comments

First of all, heartfelt congratulations on the acceptance of DFormer-V2 at CVPR! Your work is truly inspiring, and I have learned a lot from your previous V1 version.  I have a couple of questions and would greatly appreciate your guidance:  Training Setup: Was the DFormer-V2 model trained using two RTX 3090 GPUs as in the previous version? If possible, could you kindly share the hardware configuration and approximate training time for both pretraining and fine-tuning?  DELIVER Dataset Support: I noticed that you’ve included evaluation results on the DELIVER dataset in your latest paper. May I ask if you have any plans to update the codebase to support the DELIVER dataset in the future?  Thank you very much for your time and for your valuable contributions to the field. Wishing you continued success and even greater achievements in your research!

alkaidzc avatar Apr 21 '25 13:04 alkaidzc

Thank you for your interest in our work!

The DFormerv2-S model was fine-tuned using two NVIDIA 3090 GPUs, while larger model scales required four 3090 GPUs for optimization. Fine-tuning durations spanned approximately 10, 15, and 20 hours for the respective model sizes, with pretraining phases taking about 80, 110, and 150 hours on eight 3090 GPUs. Regarding the Deliver dataset implementation, while the initial training utilized another framework, we are actively refactoring the code to ensure seamless integration with our main codebase. If you are in a hurry, you can also apply our model to the deliver framework. We will update the unified version once we finish and will notify you.

Wishing you success in your research endeavors!

yinbow avatar Apr 22 '25 04:04 yinbow

你好,我想问一下验证的时候显存溢出了,用的两张5090,训练集就1000张,这是正常的嘛

intxxx avatar Oct 16 '25 13:10 intxxx

你好,我想问一下验证的时候显存溢出了,用的两张5090,训练集就1000张,这是正常的嘛

验证时显存溢出,应该检查验证集图片的尺寸和验证时的batch size, val_batch_sizethis line,图片较大的话可以考虑调小val_batch_size

yinbow avatar Oct 16 '25 16:10 yinbow