echomimic issues

为什么用绿色背景推理出来得视频带有光晕

原图: ![image](https://github.com/user-attachments/assets/30026a68-6ea5-4c0c-bd0c-d8f654f575d1) 推理后得效果图: ![image](https://github.com/user-attachments/assets/5f06ee31-b5b3-433a-b4ab-e12cf2d44e6d)

llybood

执行报错:The size of tensor a (64) must match the size of tensor b (128) at non-singleton dimension 4

6

The size of tensor a (64) must match the size of tensor b (128) at non-singleton dimension 4 我上传的文件是png格式.500K左右音频是flac格式 1.8M左右执行过程: 启动服务:python3 -u webgui.py --server_port=3000 通过页面上传图片和音频图片: ![img-jIpz5hAkPxsl0TKWGuzffkWz](https://github.com/user-attachments/assets/2bbd1075-af5d-4f28-a408-a71244b797b5) 信息:...

yeohx

Slow speed on V100

1

Still need 7 mins to run test code 'python -u infer_audio2vid.py'. How to accelerate it? Thanks! ```cmd root@dsw-448852-67578dcfc6-fh9sp:/mnt/workspace/EchoMimic# python -u infer_audio2vid.py /usr/local/lib/python3.10/site-packages/diffusers/utils/outputs.py:63: FutureWarning: `torch.utils._pytree._register_pytree_node` is deprecated. Please use `torch.utils._pytree.register_pytree_node` instead....

CurrenWong

为什么face_locator_tensor 这么慢呀。。。69s一步

6

To create a public link, set `share=True` in `launch()`. video in 24 FPS, audio idx in 50FPS whisper_chunks: (361, 50, 384) audio_fea_final: torch.Size([1, 361, 50, 384]) ref_image_latents shape: torch.Size([1, 4,...

3682483

quality drop with non 512x512 width and height (非512x512大小的输出质量变差）

3

If I modified -W and -H to non-512x512, such as (384,384), (1024, 1024), (256, 256), the lip motion is damaged in different degrees. The most severe setting is under 1024x1024,...

RockySong

Heavy Head Movement and bad Mouth/Teeth ?

2

hey there again, what slider i need to change, when my renderd video has a **heavy head movement?** i only need some movements above the neck and not floating around...

lost-in-emotions

Workaround for the Setting Parameters ?

hey there, is anywhere a tutorial or a reference how to use the sliders correctly? only -maybe - find something out without knowing what i do, is really frustrating. thats...

lost-in-emotions

Fix the bug in the inference

1

Since I saw that the background was noisy in the previous generation, and I guessed that it could be due to the effect of moore's codebase, I guessed that you...

Guohanzhong

运行所有指令都出现killed

3

运行所有指令都出现killed，是因为配置不够吗？ docker环境内存：16g cpu：12400f gpu：4070s 运行python -u webgui.py --server_port=3000出现killed，没有报错

timfengzi

Validation details

I noticed that your validation metric for [hallo](https://github.com/fudan-generative-vision/hallo) on the HDTF dataset is 501, while the original metric is 173. I would like to understand the specific details of the...

zhang-haojie

echomimic
echomimic copied to clipboard

Metadata

为什么用绿色背景推理出来得视频带有光晕

执行报错:The size of tensor a (64) must match the size of tensor b (128) at non-singleton dimension 4

Slow speed on V100

为什么face_locator_tensor 这么慢呀。。。69s一步

quality drop with non 512x512 width and height (非512x512大小的输出质量变差）

Heavy Head Movement and bad Mouth/Teeth ?

Workaround for the Setting Parameters ?

Fix the bug in the inference

运行所有指令都出现killed

Validation details

← Metadata

Owner

Metadata

echomimic echomimic copied to clipboard

Metadata

← Metadata

Owner

Metadata

echomimic
echomimic copied to clipboard