tortoise-tts
tortoise-tts copied to clipboard
Multiple speakers defined in input text
Is it possible to define multiple speakers for different portions of the input text that you feed to read.py?
Maybe via SSML syntax or, but I'm dreaming here, with natural language inside brackets (e.g., [Tom speaks:])?