video-preprocessing Why not just crop the faces with their meta data from VoxCeleb since we already have the face bboxes?

Why not just crop the faces with their meta data from VoxCeleb since we already have the face bboxes?

Open Cold-Winter opened this issue 4 years ago • 3 comments

Thank you for the elegant implementation. It helps a lot!

I am wondering why you need to detect the faces from the VoxCeleb dataset since we already have the face bounding box meta data in this dataset? Are you trying to crop tighter face bboxs instead of using their boxes? What if we train the first order model with the faces cropped by their boxes?

Apr 15 '21 16:04 Cold-Winter

Any update on this?

Aug 05 '21 18:08 charan223

Same question

Aug 24 '22 18:08 brianw0924

Same question. Besides, it seems that the provided bounding box is not a square bounding box. For example, the bounding box has a size of (1018 - 648, 553-48), i.e, (370, 505). However, this code directly resizes this rectangle image to a square one, as in here.

Feb 21 '23 07:02 HowieMa

video-preprocessing video-preprocessing copied to clipboard

Why not just crop the faces with their meta data from VoxCeleb since we already have the face bboxes?

video-preprocessing
video-preprocessing copied to clipboard