WhisperSpeech
WhisperSpeech copied to clipboard
Is the project still alive?
Is the project still alive?
It seems there is no movement and no new languages here, and some new things are happening in the TTS realm.
It could be good to know if the project will receive new updates or not, not criticism, just asking :)
Thanks!
I've been delegated some limited authority to main basic things, make basic changes, so if you'll outline what things are new and exciting, and if I'm competent in that area, I might be able to help.
@BBC-Esq is there any official plan to add support for more languages? Or is there a training guide or collab script?
This is a novel innovation and effectively competes with / replaces the need for Meta's Seamless M4T, which does not have the most permissive license. So it is a worthy pursuit. It is only lacking in language coverage.
I would love to see an organized effort to begin moving this project in the direction of more language support - either 1st party or through community LORAs etc.
@BBC-Esq Nvidia has recently released a giant multi language dataset.
https://blogs.nvidia.com/blog/speech-ai-dataset-models/
https://huggingface.co/datasets/nvidia/Granary
Unortunately, I am just a contributor and don't have access to the compute power that the founders of this repository used when supporting other languages… so I can modify the current code base, but as far as coming out with that support additional languages, I'm not able to do that unfortunately