AdvancedAutomaticSpeechRecognition icon indicating copy to clipboard operation
AdvancedAutomaticSpeechRecognition copied to clipboard

Problem with Wav2Vec finetune task

Open ayw001 opened this issue 3 years ago • 0 comments

Last year with fairseq, I successfully finetuned both base and large model with my own data. The result was very good. However, I run into the problem when I try to repeat the same finetune task. At the beginning, the training seems normal as eval_loss decline as expected but eval_wer is still 1.0. But eventual, valid_loss go to 'nan' and the output model does not produce anything. One thing I observed in log file is ctc_loss_reduction": "sum" According to your blog, that should be "mean".

I do not configure that at all.

Is this a problem? If it is, how do I fix it.

Thanks

Yuanjun

ayw001 avatar Sep 01 '22 14:09 ayw001