FullSubNet-plus icon indicating copy to clipboard operation
FullSubNet-plus copied to clipboard

soundfile.LibsndfileError: Error opening 'xx/xx/xx.wav': File contains data in an unknown format.

Open QHCV opened this issue 2 years ago • 8 comments

I find a problem when training my model:

soundfile.LibsndfileError: Error opening 'xx/xx/xx.wav': File contains data in an unknown format.

I run this in Ubuntu

I have tried many methods. What can I do about this problem? Think you!

QHCV avatar Oct 24 '22 05:10 QHCV

ffmpeg is all you need, i meet the same question in other projects, you can try conda install ffmpeg.

Plutoisme avatar Oct 24 '22 06:10 Plutoisme

ffmpeg is all you need, i meet the same question in other projects, you can try conda install ffmpeg.

image The problem is the same.

QHCV avatar Oct 24 '22 09:10 QHCV

ffmpeg cannot solve my problem (fyi, I'm trying to asr a mp4)

MonkeyDNoppu avatar Oct 24 '22 17:10 MonkeyDNoppu

I'm very sorry, but it turned out that the audio files in the dataset were corrupted, which caused this error. The dataset I used is https://github.com/microsoft/DNS-Challenge/tree/interspeech2020/master. I switched to other data sets and there was no error, thank you for your help.

QHCV avatar Oct 25 '22 06:10 QHCV

when you used the DNS-Challenge dataset, did you use the shell file 'download-dns-challenge-4.sh ' to download the data?

Plutoisme avatar Oct 25 '22 06:10 Plutoisme

when you used the DNS-Challenge dataset, did you use the shell file 'download-dns-challenge-4.sh ' to download the data? image

No, I use the DCN 2020 dataset.

QHCV avatar Oct 25 '22 06:10 QHCV

when you used the DNS-Challenge dataset, did you use the shell file 'download-dns-challenge-4.sh ' to download the data? image

No, I use the DCN 2020 dataset.

Sometimes the similar bug is relate to the encoding format,I usually use the same function to deal with different .wav files, sometimes show bug, ffmpeg may solve a part of the problem.

Plutoisme avatar Oct 25 '22 07:10 Plutoisme

Sometimes the similar bug is relate to the encoding format,I usually use the same function to deal with different .wav files, sometimes show bug, ffmpeg may solve a part of the problem.

Thank you for sharing your experience.

QHCV avatar Oct 25 '22 07:10 QHCV