Customizing voice / tone generation with specific input vocal

Open Tortoise17 opened this issue 2 years ago • 0 comments

Dear Friends,

Since, I am trying to understand the working of this tool. I want to ask audioldm2 -t "A female reporter is speaking full of emotion" --transcription "Wish you have a good day"

This is simple example which generates the audio of the female speaker which is random voice. Can we customize it to specific voice by giving input audio small file as input sequence? I have seen input sequence function. But still I could not understand how to adopt it.? Please guide.

Nov 28 '23 13:11 Tortoise17