mmpose icon indicating copy to clipboard operation
mmpose copied to clipboard

td-hm_hrnet-w32_coco doubt?

Open chen-del opened this issue 3 years ago • 2 comments

Hello! I found that dev1.0 has changed a lot, but I have doubts about the pose output, what does its output mean, and its post-processing code is encapsulated. Where is the post-processing code for the actual reasoning? Please answer your doubts, thank you! image

chen-del avatar Nov 22 '22 07:11 chen-del

Hi @chen-del , thanks for using MMPose 1.0 ! According to your screenshot, the output feature is in shape [1, 17, 96, 72], which corresponds to 17 keypoint heatmaps, each with a size of 96x72. In MMPose 1.0, the post-processing operations are defined in the codec, which is a new module to encode/decode the targets. You can refer to this tutorial for details.

Tau-J avatar Nov 22 '22 07:11 Tau-J

Ok, thank you, but I saw that bbox is actually used in it, but I don’t know where to call it. If you want to use it, does poseonnx still need a pedestrian detection model to use bbox to return the coordinates of people?

chen-del avatar Nov 22 '22 09:11 chen-del

@chen-del Sorry for late reply. Yes, for top-down algorithms, a pedestrain detector is needed.

Tau-J avatar Feb 21 '23 08:02 Tau-J