Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection icon indicating copy to clipboard operation
Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection copied to clipboard

About datapropress

Open LonelyPlanetIoT opened this issue 2 years ago • 5 comments

I have a confusing about dataprocess. I find that you resize the frame in face_detector.py in line68 when you create VideoDatasset. Will it decrease the final result? And the resize operator it is neccessory or not if I want to use dataprocess like this for other model? image

LonelyPlanetIoT avatar Apr 25 '23 13:04 LonelyPlanetIoT

This part has been hugely inspired by Selim Seferbekov work: https://github.com/selimsef/dfdc_deepfake_challenge/blob/master/preprocessing/face_detector.py We maintained the frame resize to be totally comparable, anyway, we think that it would not affect the result. Maybe you can try to remove it and see what happens.

davide-coccomini avatar Apr 25 '23 14:04 davide-coccomini

Thanks a lot

LonelyPlanetIoT avatar Apr 25 '23 14:04 LonelyPlanetIoT

I try to cancel the resize operator and a error come like the follow picture. Maybe if you don't resize the computer doesn't have enough resource to execute the code? image

LonelyPlanetIoT avatar Apr 25 '23 15:04 LonelyPlanetIoT

Can you provide the full stack trace?

davide-coccomini avatar Apr 25 '23 16:04 davide-coccomini

Sorry for replying so late. In the end ,I didn't change it. But I have another question. I use MTCNN to extract face.The thresholds is set as same as yours. In this situation, I find some videos such as NT/808_829.mp4 can not extract enough faces as the paper mentioned. And I try to decrease the thresholds. But some frames are not face. What should I do?

LonelyPlanetIoT avatar May 19 '23 16:05 LonelyPlanetIoT