echomimic
echomimic copied to clipboard
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
原图:  推理后得效果图: 
The size of tensor a (64) must match the size of tensor b (128) at non-singleton dimension 4 我上传的文件是png格式.500K左右 音频是flac格式 1.8M左右 执行过程: 启动服务:python3 -u webgui.py --server_port=3000 通过页面上传图片和音频 图片:  信息:...
Still need 7 mins to run test code 'python -u infer_audio2vid.py'. How to accelerate it? Thanks! ```cmd root@dsw-448852-67578dcfc6-fh9sp:/mnt/workspace/EchoMimic# python -u infer_audio2vid.py /usr/local/lib/python3.10/site-packages/diffusers/utils/outputs.py:63: FutureWarning: `torch.utils._pytree._register_pytree_node` is deprecated. Please use `torch.utils._pytree.register_pytree_node` instead....
To create a public link, set `share=True` in `launch()`. video in 24 FPS, audio idx in 50FPS whisper_chunks: (361, 50, 384) audio_fea_final: torch.Size([1, 361, 50, 384]) ref_image_latents shape: torch.Size([1, 4,...
If I modified -W and -H to non-512x512, such as (384,384), (1024, 1024), (256, 256), the lip motion is damaged in different degrees. The most severe setting is under 1024x1024,...
hey there again, what slider i need to change, when my renderd video has a **heavy head movement?** i only need some movements above the neck and not floating around...
hey there, is anywhere a tutorial or a reference how to use the sliders correctly? only -maybe - find something out without knowing what i do, is really frustrating. thats...
Since I saw that the background was noisy in the previous generation, and I guessed that it could be due to the effect of moore's codebase, I guessed that you...
运行所有指令都出现killed,是因为配置不够吗? docker环境 内存:16g cpu:12400f gpu:4070s 运行python -u webgui.py --server_port=3000出现killed,没有报错
I noticed that your validation metric for [hallo](https://github.com/fudan-generative-vision/hallo) on the HDTF dataset is 501, while the original metric is 173. I would like to understand the specific details of the...