SmoothNet icon indicating copy to clipboard operation
SmoothNet copied to clipboard

What is the experimental setup for human mesh recovery?

Open HospitableHost opened this issue 1 year ago • 9 comments

This table is from your paper. image In the paper, you mentioned that "SmoothNet is trained with the pose outputs from SPIN [22]. We test its performance across multiple backbone networks." But you didn't describe the experimental configuration. So which dataset are you using poses from? And what size is your sliding window here? Also, did you evaluate the metrics using the test set of the three datasets? Because I run your eval code, but the results of vibe is different from this table. image (data from pw3d_vibe_smpl_test.npz and pw3d_gt_smpl_test.npz)(and the vibe results are not affected by smoothnet pre-trained models)

HospitableHost avatar Jun 25 '23 04:06 HospitableHost

In addition, I noticed that the data from data/poses/pw3d_vibe_smpl contains 37 sequences. But this does not match with the 3DPW dataset, which has 24 sequences in test_set, 12 sequences in vald_set and 24 sequences in train_set. image

HospitableHost avatar Jun 25 '23 06:06 HospitableHost

Hi @HospitableHost ,

We used the 3dpw-spin for training. The differences may come from that we removed a few frames at the end of each period that cannot be divided by sliding windows. Did you use our provided code and data for the test?

For 37 sequences, there are some sequences that have two persons. We simply split them into two sequences.

ailingzengzzz avatar Jun 25 '23 08:06 ailingzengzzz

@ailingzengzzz Hi, I read your code carefully and tested with your code and data. (data/poses/pw3d_vibe_smpl/...) Therefore, I don't know what caused the VIBE results to be different from your paper. image image

By the way, I still have three questions: first, did you use spin---3dpw_train_set for training, right? second, do the 37 sequences correspond to the 3dpw_test_set? third, could you please open-source the pretrained model: 3DPW-SPIN-SMPL (not 3DPW-SPIN-3D)

best wishes!

HospitableHost avatar Jun 26 '23 08:06 HospitableHost

Hi @HospitableHost,

The answer to questions 1 and 2 is yes. We explored the 3DPW-SPIN-SMPL for SMPL (6d rotation matrix) testing but found it is inferior to 3DPW-SPIN-3D (see the paper). Thus, we only provided the 3d model for cross-modality testing.

ailingzengzzz avatar Jun 27 '23 04:06 ailingzengzzz

Hi @ailingzengzzz So have you figured out why the results of VIBE are inconsistent with the paper? I suspect that there is something wrong with the data in the folder data/poses/pw3d_vibe_smpl.

HospitableHost avatar Jun 28 '23 03:06 HospitableHost

Besides, I run the evalution of this command : CUDA_VISIBLE_DEVICES=7 python eval_smoothnet.py --cfg configs/pw3d_spin_3D.yaml --checkpoint data/checkpoints/pw3d_spin_3D/checkpoint_32.pth.tar --dataset_name pw3d --estimator spin --body_representation smpl --slide_window_size 32

The result is also different from your paper. image image

HospitableHost avatar Jun 28 '23 06:06 HospitableHost

Hi @HospitableHost ,

The results in Table 3 are calculated via 3D keypoint positions following previous works (e.g., VIBE, PARE, HMR etc). They use a model to estimate SMPL parameters and transform them into 3D keypoint positions for MPJPE, PA-MPJPE, and ACCEL. We simply input the transformed 3D keypoint into SmoothNet to obtain the final 3D keypoint positions.

ailingzengzzz avatar Sep 28 '23 14:09 ailingzengzzz