IPED
IPED copied to clipboard
Option to use a language model with Wav2Vec2 transcription
This was left as a future improvement of #1214. This should be investigated: https://github.com/jonatasgrosman/huggingsound/issues/62
I still wasn't able to use the (suggested) KenshoLMDecoder implementation for a language model from huggingsound library properly to evaluate this. But I managed to use ParlanceLMDecoder implementation together with Jonatas Grosman's fine tuned wav2vec2 large portuguese model, results were not good: