How to apply to videos in the wild?

Open hellojialee opened this issue 4 years ago • 1 comments

Hi, thanks for your great work. I have just entered into the 3D human pose. I admire your work. Your paper estimates Z/f using weak perspective model, which SMAP estimates Zw/f, in which Z is the original depth, and f and w are the focal length and the image width both in pixels. Which is better? I think your estimation is in the real space and theirs is in a normalization space.

[1] SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation

Jan 13 '22 09:01 hellojialee

PS. Both of you aim to estimate the distance in meters.

Jan 13 '22 09:01 hellojialee