BasicSR icon indicating copy to clipboard operation
BasicSR copied to clipboard

Multiples of 96 / 384 ESRGAN

Open davidvfx07 opened this issue 3 years ago • 1 comments

So I am using BasicSR to upscale the output of Wav2Lip to a more usable size. By training my own model every use gives me accurate teeth and eyes and really helps with the believability. Wav2Lip outputs 96x96, x4 is 384x384. I cannot seem to find a way to train ESRGAN with VGGStyleDiscriminator for 384x384.

Any help appreciated!

davidvfx07 avatar Jan 28 '22 15:01 davidvfx07

So I am using BasicSR to upscale the output of Wav2Lip to a more usable size. By training my own model every use gives me accurate teeth and eyes and really helps with the believability. Wav2Lip outputs 96x96, x4 is 384x384. I cannot seem to find a way to train ESRGAN with VGGStyleDiscriminator for 384x384.

Any help appreciated!

Hello, I trained real-esrgan in a 10 minute single person video, and the effect was that my mouth was kept closed and couldn't be opened. How can I solve this problem?

einsqing avatar Oct 08 '23 02:10 einsqing