RealtimeTTS icon indicating copy to clipboard operation
RealtimeTTS copied to clipboard

Add F5 TTS

Open pabloortegaa opened this issue 1 year ago • 8 comments

I just saw this TTS model(https://github.com/SWivid/F5-TTS), which works very well for English. Are you planning on including it on the project? Thanks!!

pabloortegaa avatar Oct 22 '24 09:10 pabloortegaa

Absolutely thinking about that, I love that TTS system. Things that keep me off currently:

  1. Not on pypi - without pip install I need to copy their whole repo, don't want that
  2. No streaming support so bigger latency than for other engines Both are kind of bummers but 1 is the bigger one. I need a solution to gracefully install it with RealtimeTTS

KoljaB avatar Oct 22 '24 10:10 KoljaB

Hi! @KoljaB, would pip install git+https://github.com/SWivid/F5-TTS.git not suffice as a pip install?

aadr1024 avatar Nov 11 '24 12:11 aadr1024

Sadly not, pypi would not accept this within a setup file. So I can't integrate it to be installed with "pip install RealtimeTTS[F5]" currently. Also no streaming support is still a bummer. Still my first pick TTS system that I would love to integrate (together with GPT-SoVITS).

KoljaB avatar Nov 11 '24 14:11 KoljaB

Looked into F5-TTS code.

Implementing real-time streaming is not trivial because the model processes entire sequences at once using full-sequence attention and ODE integration. This doesn't support incremental output. So it would require significant architectural changes to support incremental, real-time computation needed for streaming.

KoljaB avatar Nov 11 '24 18:11 KoljaB

I have a working version of F5-TTS that streams and uses a apache licensed model trained on English, F5-TTS now has pip install f5-tts, perhaps this will help someone out. All info here - https://drive.google.com/drive/folders/1LmAkK08YFWz57e8EF8ZgovqspKFk56xQ?usp=sharing

MercyfulKing avatar May 16 '25 06:05 MercyfulKing

Wow, nice. I'll definitely look into that soon, thanks a lot.

KoljaB avatar May 16 '25 08:05 KoljaB

Hi all, is there any update on adding the F5 TTS?

BWJ2310 avatar Jul 10 '25 19:07 BWJ2310

Looked into it again some weeks ago. It's not ready for real-time implementations.

KoljaB avatar Jul 10 '25 20:07 KoljaB