piper icon indicating copy to clipboard operation
piper copied to clipboard

timings of phonems as it gets streamed raw

Open TechHackie opened this issue 9 months ago • 1 comments

It would be very helpful if I could anyhow get the timings of phonemes frames being produced, I can't seem to find any.. other tts implmentations have this feature. Any help is welcome.

Thanks

TechHackie avatar Mar 19 '25 13:03 TechHackie

Looking for the same thing as I'd like to use it for animation of an avatar. Perhaps the only way to do this is to run the wav file output through some other analysis software that detects phonemes (or at least some major ones like the vowels at least), but I have not yet found a good tool for that. It would ofc be fantastic if this could be done directly by Piper.

64jcl avatar Apr 29 '25 11:04 64jcl