parler-tts
parler-tts copied to clipboard
Streaming support?
Is there any streaming support for this model? if there is a way to do it i would love to get involved and help out!
Hey @jay2jp, thanks for opening the discussion!
Streaming could be done by adapting the Streamer used for Musicgen with the current code! It'd be great if you could be involved in this! What do you think?
Let me know if you need help! Best
Hey @ylacombe,
I would be more than happy to to adopt the module to the current parler-tts, i did a research and found out @sanchit-gandhi already created a space for it on huggingface and the Streamer module so i don't know if there is still need for it to be added.
But to make the module more advanced i wanted to know, is there any way we get to know the duration of each word getting streamed? and their start time ( offset ) in the full audio? this could help a lot of developers to have more control over the streamed audio, for example it can be used for lipsyncs, or real time function control and .....
So if it's possible in any way, i would be more than happy to extend the streaming module of parler ( indeed with a little bit of your kind help 😅). Best regards
I want to do something like this so that it can read text incrementally (epub reader). However, I'm way out of my depth, I thought this would be a nice little idea to get started with AI but I don't know much and not even the search terms to get started. I'm sorry if I come across as too ignorant or lazy when asking this. I truly don't know where to start, so any help even a let me google would be awsome.
Hi @ylacombe ,I am useing parler tts-miniv1,i am faceing one issue ,For large amount of text ,the voice is not good and clear .I am sharing my code snippet