VibeVoice icon indicating copy to clipboard operation
VibeVoice copied to clipboard

Can we generate emotional speech with different tags like pauses silence laughing etc.

Open dharmesh8b opened this issue 4 months ago • 3 comments

dharmesh8b avatar Sep 02 '25 10:09 dharmesh8b

Explicit speech tagging is not supported at the moment.

YaoyaoChang avatar Sep 02 '25 10:09 YaoyaoChang

Here is an interesting way to control speech and emotion by voice prompt, details can be found via HF discussion12.

YaoyaoChang avatar Sep 03 '25 08:09 YaoyaoChang

Thanks @YaoyaoChang for sharing the great idea.

To sum it up:

  • Use the 4 speaker "slots" and put the same voice in all of them, but with different voice clips containing different emotions like shouting etc.
  • Then write your script with "Speaker 0: Hello! Speaker 1: What's up!" syntax, to trigger the different emotions. You have 4 slots, so you can mark 4 different emotions this way.

Arcitec avatar Sep 07 '25 12:09 Arcitec