TensorFlowASR
TensorFlowASR copied to clipboard
Extracting phonems from audio
Can this be modified to extract Phonems with their start and end time stamps in audio file?
I believe we can, since all we need to do is making a custom version of the ASRDataset, Speech Featurizer, Text Featurizer and feed the dataset into models with phonemes classes.