3d_pose_baseline_pytorch icon indicating copy to clipboard operation
3d_pose_baseline_pytorch copied to clipboard

coordinate space of GT 2d pose

Open kyang-06 opened this issue 6 years ago • 9 comments

Hi, I notice that the 2d pose ground truth in *_2d.pth.tar lies in original image space(1002x1000) after unnormalizing, which is not as expected as cropped image space in 256x256. Is this the same as the original version? BTW, I regenerated the hm36 data with cropped 2d pose, and just achieved error of 48mm. I wonder if this difference lead to the performance gap. THX!

kyang-06 avatar May 27 '19 05:05 kyang-06

same as the original repo, the 2d pose is obtained from the 1000x1000 image, which means that the unnormalized 2d pose lies in 1000x1000.

weigq avatar May 27 '19 06:05 weigq

Thanks for quick response! So that is to say, the author's implementation in tensorflow does not match what they described in the paper? ( 440x440 -> 256x256 in the paper)

kyang-06 avatar May 27 '19 06:05 kyang-06

I do not know the details how they generate the 2d detections, however, i think, they crop the image when running the staked hourglass. And the final 2d pose they provide is aligned back to 1000x1000 image.

weigq avatar May 27 '19 08:05 weigq

One more question about the corresponding relation between 2d and 3d joints. Is the 2d input the same as orign repo that includes hip and excludes nose? If so, it's not an one-to-one regression as orign repo. The hip-based 3d pose has an extra nose joint.

kyang-06 avatar May 31 '19 06:05 kyang-06

in fact, same as the origin repo, the mapping is not one-to-one (the jaw joint only appears in 3D pose), most 3D pose estimation methods also do as this.

weigq avatar Jun 01 '19 00:06 weigq

@weigq I am wondering that if the inputed 2d pose can be arbitrarily size, which is related with the image size. should I resize the 2d joints into a regular size before I run the inference?

wuhailang911 avatar May 12 '20 09:05 wuhailang911

@ky66111 @weigq Could you share the idea of unnormalizing the 2d inputs? I cannot find the mean and std of 2d poses in the given files. Thanks!

lee-man avatar Jun 25 '21 02:06 lee-man

@lee-man Were you able to unnormalize the 2d data? I'm having the same issue as you.

andresherrera97 avatar Jul 29 '21 13:07 andresherrera97

@lee-man Were you able to unnormalize the 2d data? I'm having the same issue as you.

@andresherrera97 Sorry that I missed your reply. I can send you the statistics of 2d for unnormalization if you still need it.

lee-man avatar Sep 16 '21 07:09 lee-man