GnTCN
GnTCN copied to clipboard
How to apply to videos in the wild?
Hi, thanks for your great work. I have just entered into the 3D human pose. I admire your work. Your paper estimates Z/f using weak perspective model, which SMAP estimates Zw/f, in which Z is the original depth, and f and w are the focal length and the image width both in pixels. Which is better? I think your estimation is in the real space and theirs is in a normalization space.
[1] SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
PS. Both of you aim to estimate the distance in meters.