MuseTalk
MuseTalk copied to clipboard
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
提供推理的音频,中间和结尾有一段没有声音的,但是嘴巴一直在动,请问可以怎么优化吗?

很好的工作内容,对于图片作为输入的话。画质肉眼较好。但当视频作为输入的时候,画质会损失的比较严重。该如何解决。 
你好,在我们的的部署中发现一个问题,即在VAE的模型中,将GPU上的数据拷贝到CPU上花费了巨量的时间。简单来说就是在不考虑这一步的情况下,实时性可以达到60+的fps。但是因为它的存在导致我们的性能只能在30fps左右。请问有没有什么办法在这个基础上做到优化呢?这是因为显卡位宽所导致的吗?我们的实验环境是4090。
Thank your great work! I conducted some experiments and found that the fidelity of the generated faces is poor, generated person does not resemble the original video closely. Could you...
ImportError: cannot import name 'ForkProcess' from 'multiprocessing.context'
近景时候嘴部会变得奇怪且很模糊,中景时候就挺好的, 希望修复下,感谢作者
Hi, can anyone help me resolve this issue? Thank you! I encountered the error below when running `app.py`, specifically `imageio.mimwrite(output_video, images, 'FFMPEG', fps=fps, codec='libx264', pixelformat='yuv420p')` Error message: ``` File "/usr/local/lib/python3.10/dist-packages/imageio_ffmpeg/_io.py",...
ImportError: /home/ubuntu/miniconda3/envs/myenv/lib/python3.10/site-packages/mmcv/_ext.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops10zeros_like4callERKNS_6TensorEN3c108optionalINS5_10ScalarTypeEEENS6_INS5_6LayoutEEENS6_INS5_6DeviceEEENS6_IbEENS6_INS5_12MemoryFormatEEE