wav2lip-hq icon indicating copy to clipboard operation
wav2lip-hq copied to clipboard

Input shapes to the VGG discriminator are wrong

Open shehrum opened this issue 2 years ago • 3 comments

When running the model on 384 * 384 size imaged, I get the following error from VGG discriminator in the ESRGAN

assert x.size(2) == 128 and x.size(3) == 128, (f'Input spatial size must be 128x128, ' AssertionError: Input spatial size must be 128x128, but received torch.Size([4, 3, 384, 384]).

shehrum avatar Jun 09 '22 13:06 shehrum

me too.I have the same problem. Do you know how to solve it?

Chesuour avatar Jun 29 '22 02:06 Chesuour

same problem

yo-WASSUP avatar Jun 30 '22 06:06 yo-WASSUP

Same here

AIMads avatar Aug 20 '22 13:08 AIMads