3DCrowdNet_RELEASE
3DCrowdNet_RELEASE copied to clipboard
How to predict the whole person in the image?
Hi, thanks for sharing your code. I notice that this model inputs the cropped and resized image and is trained to predict SMPL parameters and camera parameters once a person. As a result, if there's more than one person in the image, we detect the human and crop the image with human detection results. I'm wondering how to input the original image without cropping. However, I got a few questions in dealing with the dataset. Could you help me with it?
- For the camera parameters, Do I need to predict the camera parameters per person or image? (A image may have many persons, and I don't decide to crop the image.)
- Which key points in targets need to be changed?
-
Yes. The camera parameters are actually the translation of each person in the camera frame.
-
I don’t understand. 3DCrowdNet is a top down method and crop is required to get proper image features