dust3r icon indicating copy to clipboard operation
dust3r copied to clipboard

About 3 steps in training

Open wuqun-tju opened this issue 1 year ago • 1 comments

Thanks for your excellent work! In the process of training, there are 3 steps, the head of first two is linear, and the last one is dpt. I wonder whether it is neccessary to train linear head? If we train dpt head directly without training linear head, is it right?

wuqun-tju avatar Jul 18 '24 06:07 wuqun-tju

Hi, from our limited experiments, we got slightly better results by training with the linear head first (I don't have numbers on hand to back it up though).

yocabon avatar Jul 18 '24 07:07 yocabon