3DCrowdNet_RELEASE How to predict the whole person in the image?

How to predict the whole person in the image?

Open xljh0520 opened this issue 1 year ago • 1 comments

Hi, thanks for sharing your code. I notice that this model inputs the cropped and resized image and is trained to predict SMPL parameters and camera parameters once a person. As a result, if there's more than one person in the image, we detect the human and crop the image with human detection results. I'm wondering how to input the original image without cropping. However, I got a few questions in dealing with the dataset. Could you help me with it?

For the camera parameters, Do I need to predict the camera parameters per person or image? (A image may have many persons, and I don't decide to crop the image.)
Which key points in targets need to be changed?

Aug 24 '22 09:08 xljh0520

Yes. The camera parameters are actually the translation of each person in the camera frame.
I don’t understand. 3DCrowdNet is a top down method and crop is required to get proper image features

Aug 31 '22 12:08 hongsukchoi

3DCrowdNet_RELEASE 3DCrowdNet_RELEASE copied to clipboard

How to predict the whole person in the image?

3DCrowdNet_RELEASE
3DCrowdNet_RELEASE copied to clipboard