dinov2
dinov2 copied to clipboard
Unable to replicate downstream depth result on KITTI
I evaluate the pretrained dinov2 with different decoder heads on KITTI Eigen Split in order to replicate the paper's numbers. I found the results much worse.
Here's what I did. I load the models as shown in the notebook. I load the appropriate KITTI weights and check on an example KITTI image. Looks good. I modified this evaluation script to not convert disparity to depth and to not scale the output and ran the numbers.
![]() |
model | abs_rel | sq_rel | rmse | rmse_log | a1 | a2 | a3 |
---|---|---|---|---|---|---|---|
small+dpt | 0.378 | 2.788 | 7.372 | 0.336 | 0.218 | 0.866 | 0.983 |
base+dpt | 0.392 | 3.085 | 7.963 | 0.345 | 0.179 | 0.852 | 0.986 |
large+dpt | 0.276 | 1.938 | 6.378 | 0.267 | 0.536 | 0.927 | 0.991 |
The reported RMSE is 2.34, 2.23, 2.14 for the small, base and large models with DPT respectively. Am I missing something?