About DPT depth scale and shift initialization

Open K-nowing opened this issue 2 years ago • 1 comments

Hi, thank you for code release

In the default config file, you used checkpoint "dpt_hybrid-midas", but why did you use the scale and shift value of checkpoint "dpt_hybrid-nyu" for preprocessing? I think the initial values of DPT depth affects the performance.

Apr 20 '23 14:04 K-nowing

Hi, thanks for your interest and thanks for identifying this problem. For the DPT model, we simply took the first model provided in their code and use a set of parameters they provided that can recover depth. It could probably be better if you use the corresponding fine-tuned checkpoint instead. You can also try other depth estimations as long as the depths are in reasonable scales within the sampling range.

Apr 26 '23 18:04 bianwenjing