TTS-WebUI icon indicating copy to clipboard operation
TTS-WebUI copied to clipboard

SeamlessM4T Using audio files to implement translation !Can you support it?

Open curui opened this issue 1 year ago • 3 comments

QQ截图20240310024054 Translate other languages via audio files like this!

curui avatar Mar 09 '24 18:03 curui

Hi, please try the newest update, I added a basic implementation for testing. Not all of the languages in the list are supported, and I don't have the S2TT, T2TT and ASR yet, I just added the S2ST functionality.

https://github.com/rsxdalv/tts-generation-webui/pull/284 127 0 0 1_7860_ (1)

rsxdalv avatar Mar 10 '24 17:03 rsxdalv

您好,请尝试最新的更新,我添加了一个基本的测试实现。并非列表中的所有语言都受支持,而且我还没有 S2TT、T2TT 和 ASR,我只是添加了 S2ST 功能。

第284章 127 0 0 1_7860_ (1)

thank u!You can refer to this https://replicate.com/adirik/seamless-expressive. They can recognize speech and translate the same timbre, just like heygen. They seem to use seamless. For example: Upload the audio of an American who can translate and speak Chinese with the same accent/tone

curui avatar Mar 14 '24 20:03 curui

Interesting, the results on replicate seem better than what I saw myself so far. Thank you! This gives more research to do.

rsxdalv avatar Mar 14 '24 21:03 rsxdalv