blog icon indicating copy to clipboard operation
blog copied to clipboard

Unable to recreate Timit dataset results using wav2vec pretrained model

Open paulista5 opened this issue 3 years ago • 7 comments

Hi,

I am unable to recreate the results reported in your blogpost https://huggingface.co/blog/fine-tune-wav2vec2-english on the timit dataset. I just ran the notebook as it is on google-colab, but got very different results. I am attaching a screenshot of the training and validation results. The results are completely different from the ones reported in the blog post. Can u please let m know the issue here? and how we can recreate the results? Thanks Screenshot 2021-04-07 at 8 05 37 PM

paulista5 avatar Apr 07 '21 17:04 paulista5

hi i am facing the same issue as @paulista5 . I am doing the same exercise on custom data , still facing the same WER 1.0 can any you @patrickvonplaten please help out here ?

imtiaz3990 avatar Apr 14 '21 09:04 imtiaz3990

Hey @paulista5,

Many people have tried to run the notebook and it seemed to work fine - did you change any settings? Also to me it looks like your model is heavily overfitting -> maybe you should add some weight decay and reduce the learning rate

patrickvonplaten avatar Apr 21 '21 21:04 patrickvonplaten

Hi, I am also facing the same issue. The WER is fixed at 1 throughout the training.

sourabharsh avatar Jun 26 '21 18:06 sourabharsh

Also, the timit dataset seems to have identical samples in your notebook. Screen Shot 2021-06-26 at 11 45 10 PM

sourabharsh avatar Jun 26 '21 18:06 sourabharsh

I think this happened with an old version of datasets, could you try updating the datasets library?

patrickvonplaten avatar Jun 27 '21 11:06 patrickvonplaten

image

can anyone help me out with this?

ahmedlone127 avatar Jul 20 '21 11:07 ahmedlone127