whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

Feature request - Support WhisperSpeech for voice generation with whisper model

Open LSXAxeller opened this issue 10 months ago • 1 comments

WhisperSpeech is a text-to-speech/voice generation/voice cloning model derived from OpenAI's Whisper model inversion. Integrating support for it into Whisper.cpp would enhance the functionality of whisper.cpp to cover all primary voice operations. can it be integrated ? WhisperSpeech Repo

LSXAxeller avatar Apr 23 '24 17:04 LSXAxeller

Thank you for mentioning:

  • https://github.com/collabora/WhisperSpeech

Seems interesting. But I'm not sure if Georgi wouldn't prefer to have it as a separate project, e.g. whisperspeech.cpp. :) Having both ASR and TTS in one project would be cool for sure, but it could make the maintenance harder, and maybe some tweaked version of whisper is needed there (don't know, as I haven't investigated it).

whisperspeech.cpp could be a nice alternative to:

  • https://github.com/PABannier/bark.cpp

przemoc avatar Apr 28 '24 19:04 przemoc

Looks like an interesting project, but probably I won't get around to integrating it anytime soon

ggerganov avatar May 13 '24 11:05 ggerganov