Wav2Lip icon indicating copy to clipboard operation
Wav2Lip copied to clipboard

How to train SyncNet with frames of 3 and 7?

Open wtc9806 opened this issue 1 year ago • 0 comments

Hello, I want to train Syncnet with the number of image sequences at 3 and 7, but I don't know if my configuration is correct. In the case of 5 frames of image, the syncnet_T is 5 and the syncnet_mel_step_size is 16. One frame of image corresponds to 3.2 frames of audio. So, When the input image is 3 and 7 frames, the corresponding syncnet_mel_step_size is 9.6 and 22.4???

wtc9806 avatar Dec 18 '23 12:12 wtc9806