STT-models icon indicating copy to clipboard operation
STT-models copied to clipboard

Verify alphabet in pb* and tflite models

Open JRMeyer opened this issue 3 years ago • 2 comments

The alphabet files from Jaco models are inconsistent with the output of the models at runtime. It has been observed that the Jaco Spanish model can produce accented vowels, but the alphabet file does not include them. The alphabet file should be confirmed and uploaded to the zoo for language model generation.

TFModelState::init and TFLiteModelState::init can be modified to print out the loaded alphabet used to train the model here: https://github.com/coqui-ai/STT/blob/653ce25a7ce5bd6cbb564416d847d8afcd5c5e8c/native_client/tfmodelstate.cc#L120

JRMeyer avatar Jun 24 '21 08:06 JRMeyer