openspeech icon indicating copy to clipboard operation
openspeech copied to clipboard

Best decoder?

Open OleguerCanal opened this issue 3 years ago • 1 comments

❓ Questions & Help

Hi guys! First of all thank you so much for such an amazing repo :)

I'd like to know if you have some insights on which decoder architecture works best for end2end training for medium-hard audio. Imagine that model size and available of data are not a problem. Have you done some tests or know of some paper comparing them?

OleguerCanal avatar Feb 24 '22 12:02 OleguerCanal

Hi @OleguerCanal, Did you figure out which encoder - decoder pair was the most successful regarding your experiments ?

I am training Contextnet & Conformer encoders together with transducer decoders (not converging at all) and LSTM decoder: lstms are converging but the output predictions seems not perfectly aligned though it outputs correct words (some words are keep being repeated)

I am training on Librispeech for now.

Best

virgile-blg avatar Apr 06 '22 08:04 virgile-blg