Wav2Lip icon indicating copy to clipboard operation
Wav2Lip copied to clipboard

The output video have clear box around mouth region.

Open rohaantahir opened this issue 2 years ago • 8 comments

Hi, I have tried the library its an exceptional work. I am doing inference but facing an issue regarding the output. The output generated have some boxy effect around the mouth. Is there any way to remove it ?

rohaantahir avatar Aug 22 '22 10:08 rohaantahir

It seems like adjusting resize_factor argument to get a lower resolution video can make the box less obvious,but also blur the whole result video...

YoungXu06 avatar Aug 30 '22 06:08 YoungXu06

Hi @rohaantahir ,

I also met the same question, have you solved it?

liuquande avatar Sep 19 '22 03:09 liuquande

@liuquande no the issue is with their model. It is trained on low resolution that is why bounding box is created.

rohaantahir avatar Sep 19 '22 09:09 rohaantahir

You can change the network topology to work with 256x256 or 512x512 resolution and train it from scratch

NikitaKononov avatar Nov 05 '22 12:11 NikitaKononov

change img_size from 96 to 256 or 512?

wllps1988315 avatar Jan 11 '23 04:01 wllps1988315

change img_size from 96 to 256 or 512?

that just causes errors because the model is expecting 96 so when i change it to 256 it throws the architecture off.

really looking for a fix as the video output quality is pretty bad with that blurry rectangle that appears over the modified portion of the image

1800dinky avatar Nov 07 '23 05:11 1800dinky

Would fine-tune the model resolve this issue? How to do that?

Oyiyi avatar Nov 20 '23 04:11 Oyiyi

@1800dinky I'm pretty sure they do it on purpose

FreeTymeKiyan avatar Jan 20 '24 04:01 FreeTymeKiyan