SyncTalk icon indicating copy to clipboard operation
SyncTalk copied to clipboard

Real-time inference

Open a-ghorbani opened this issue 10 months ago • 11 comments

Is SyncTalk suitable for real-time inference? Are there any stats about latency anf performance? Are there any benchmarks or optimization tips for real-time use?

a-ghorbani avatar Apr 08 '24 05:04 a-ghorbani

Good question !!

oguzhandoganoglu avatar May 09 '24 09:05 oguzhandoganoglu

It definitely can. On my RTX 3090 I can have ~50 fps if everything is loaded to GPU. Here is my example of realtime chatting using SyncTalk:

https://github.com/ZiqiaoPeng/SyncTalk/assets/12045814/623e5309-7d71-4fab-ad41-2280566b422a

jinqiupeter avatar May 09 '24 15:05 jinqiupeter

How do you send a request after training the model? Can you share the code? Is it in the form of a test in the documentation?

oguzhandoganoglu avatar May 09 '24 15:05 oguzhandoganoglu

Lipsync is also very good:

https://github.com/ZiqiaoPeng/SyncTalk/assets/12045814/67bb8404-3f34-4363-b260-adcba5fefe3b

jinqiupeter avatar May 09 '24 15:05 jinqiupeter

Video doesn’t work. Alsı could you please answer my question if possible

oguzhandoganoglu avatar May 09 '24 16:05 oguzhandoganoglu

How do you send a request after training the model? Can you share the code? Is it in the form of a test in the documentation? @jinqiupeter

oguzhandoganoglu avatar May 09 '24 23:05 oguzhandoganoglu

@jinqiupeter which audio encoder are you using - ave? hubert? deepspeech?

varunjain99 avatar Jun 11 '24 19:06 varunjain99

@jinqiupeter which audio encoder are you using - ave? hubert? deepspeech?

both ave and hubert, they have very similar metrics for my video

jinqiupeter avatar Jun 12 '24 03:06 jinqiupeter

both ave and hubert, they have very similar metrics for my video

@jinqiupeter Is your demo video using the GUI code path from synctalk repo? I believe this is the code path supporting real time streaming of audio, but I wasn't able to get it to work with ave

What code path does real time asr with ave or hubert?

varunjain99 avatar Jun 12 '24 22:06 varunjain99

@jinqiupeter Hello. Glad to know you. I'm working on Real-time for SyncTalker. I see you have solved this problem. Can you give me a little information? Thank you for your kindness

nguyenchiminhhieu avatar Sep 14 '24 06:09 nguyenchiminhhieu

Does anyone figure out how to run it as realtime ? would be of great help to know about it

Deltaidiots avatar Oct 03 '24 09:10 Deltaidiots