Lip2Wav icon indicating copy to clipboard operation
Lip2Wav copied to clipboard

during preprocess how to save frames without faces?

Open dongdongdashen opened this issue 3 years ago • 2 comments

Hi,this is a great job. I try to use my own dataset to reconstruct the speech.The dataset are videos including medical images of vocal organs without human faces.Can you tell me how to save these frames without faces? Thanks a lot!

dongdongdashen avatar Nov 26 '21 05:11 dongdongdashen

Hi, you would need a different preprocessing script, where you specify in each frame which part of the image to save as a "crop". In our evaluation script, for example, we save the face region given by the face detector as the crop for that frame.

prajwalkr avatar Dec 01 '21 15:12 prajwalkr

Thanks for your reply!Now I can get the medical images list but it seems can't run in training. I am now looking for the cause. I notice the chem images are about 120 x 180 and mine are 580 x 360.Do I need to adjust the size of my images before training?Besides,my videos are 60 fps not 30 fps (3~5 seconds each),do I need to modify related parameters to match my data?

dongdongdashen avatar Dec 03 '21 09:12 dongdongdashen