Clipping off or skipping the first word on a chapter
Kokoro has been such an improvement!
I am having an issue where the first word of a recording is clipped off, for example "In it's landmark message..." will be read as "its landmark message..."
This happens, not always, but fairly often. Any idea why or a solution?
Thanks, this has been so useful. I am "reading" so much more XD
I've noticed that too, I think it depends on the voice.
Please give https://github.com/aedocw/epub2tts-kokoro a try if you have time. It might behave differently (that's the only one I've used in quite a while). Also try different voices and see if it does the same thing.
I observe the issue with kokoro (am_michael and others) also depending on whether "skip-parts" feature flag is also set. I can add a few things:
- Depending on the voice, some only "clip" on consonants. Some clip on vowels.
- The before-and-after nearby phoneme elements can cause this
- I believe the solution is to adjust up by 0.5s the silence defaults at the edges, and provide the user the ability to adjust this silence, as it's a core narrative element, stylistically.
Either way, getting more isolation between waveform synthesis steps should solve this clipping issue (and several others.)