dinov2 icon indicating copy to clipboard operation
dinov2 copied to clipboard

Unable to replicate downstream depth result on KITTI

Open zshn25 opened this issue 9 months ago • 4 comments

I evaluate the pretrained dinov2 with different decoder heads on KITTI Eigen Split in order to replicate the paper's numbers. I found the results much worse.

Here's what I did. I load the models as shown in the notebook. I load the appropriate KITTI weights and check on an example KITTI image. Looks good. I modified this evaluation script to not convert disparity to depth and to not scale the output and ran the numbers.

example_input kitti
model abs_rel sq_rel rmse rmse_log a1 a2 a3
small+dpt 0.378 2.788 7.372 0.336 0.218 0.866 0.983
base+dpt 0.392 3.085 7.963 0.345 0.179 0.852 0.986
large+dpt 0.276 1.938 6.378 0.267 0.536 0.927 0.991

The reported RMSE is 2.34, 2.23, 2.14 for the small, base and large models with DPT respectively. Am I missing something?

zshn25 avatar Sep 22 '23 08:09 zshn25