dc_tts
dc_tts copied to clipboard
Vocabulary ( ) ! ; , - and eval sentences
The LJSpeech Dataset contains at least the additional characters ( ) ! ; , - Comma and dash are included in the original paper. Are these characters intentionally omitted?
Also i think the eval sentences give very little insight about the capabilities of the network. With this set we don't know if
- Questions are pronounced correctly
- Pauses after comma are reasonable
- Context based pronounciation is correct (e.g from tacotron2 sample "He has read the whole thing. ")