open-tts-tracker
open-tts-tracker copied to clipboard
🗣️ Open TTS Tracker
A one stop shop to track all open-access/ source TTS models as they come out. Feel free to make a PR for all those that aren't linked here.
This is aimed as a resource to increase awareness for these models and to make it easier for researchers, developers, and enthusiasts to stay informed about the latest advancements in the field.
[!NOTE]
This repo will only track open source/access codebase TTS models. More motivation for everyone to open-source! 🤗
| Name | GitHub | Weights | License | Fine-tune | Languages | Paper | Demo | Issues |
|---|---|---|---|---|---|---|---|---|
| XTTS | Repo | 🤗 Hub | CPML | Yes | Multilingual | Technical notes | 🤗 Space | |
| TorToiSe TTS | Repo | 🤗 Hub | Apache 2.0 | Yes | English | Technical report | 🤗 Space | |
| VITS/ MMS-TTS | Repo | 🤗 Hub / MMS | Apache 2.0 | Yes | English | Paper | 🤗 Space | |
| Pheme | Repo | 🤗 Hub | CC-BY | Yes | English | Paper | 🤗 Space | |
| OpenVoice | Repo | 🤗 Hub | CC-BY-NC 4.0 | No | ZH + EN | Paper | 🤗 Space | |
| IMS-Toucan | Repo | GH release | Apache 2.0 | Yes | Multilingual | Paper | 🤗 Space | |
| Matcha-TTS | Repo | GDrive | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
| pflowTTS | Unofficial Repo | GDrive | MIT | Yes | English | Paper | Not Available | GPL-licensed phonemizer |
| StyleTTS 2 | Repo | 🤗 Hub | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
| VALL-E | Unofficial Repo | Not Available | MIT | Yes | NA | Paper | Not Available | |
| HierSpeech++ | Repo | GDrive | CC-BY-NC-SA 4.0 | No | KR + EN | Paper | 🤗 Space | |
| Bark | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
| EmotiVoice | Repo | GDrive | Apache 2.0 | Yes | ZH + EN | Not Available | Not Available | Separate GUI agreement |
| Amphion | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
| xVASynth | Repo | GH commit | GPL-3.0 | Yes | Multilingual | Paper | Not Available | Copyright materials used for training. |
| OverFlow TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
| Neural-HMM TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
| Tacotron 2 | Unofficial Repo | GDrive | BSD-3 | Yes | English | Paper | Webpage | |
| Glow-TTS | Repo | GDrive | MIT | Yes | English | Paper | GH Pages | |
| Silero | Repo | GH links | CC BY-NC-SA | No | EM + DE + ES + EA | Not Available | Not Available | Non Commercial |
| MahaTTS | Repo | 🤗 Hub | Apache 2.0 | No | English, Hindi, Indian English, Bengali, Tamil, Telugu, Punjabi, Marathi, Gujarati, Assamese | Not Available | Recordings, Colab |
How can you help?
Help make this list more complete. Create demos on the Hugging Face Hub and link them here :) Got any questions? Drop me a DM on Twitter @reach_vb.