text-to-speech-api
text-to-speech-api copied to clipboard
Can I get word level timestamp of the narration?
After the article (text) is converted to audio, can I get word level timestamp of the speech as part of the metadata? This feature is extremely important for our use case. Thank you