ULCA-asr-dataset-corpus icon indicating copy to clipboard operation
ULCA-asr-dataset-corpus copied to clipboard

File format issue in Malayalam dataset

Open kavyamanohar opened this issue 3 years ago • 0 comments

The Malayalam dataset in the categories: dd_malayalam, joshtalks and The_Cue can not be played or processed due to some file formatting issue.

It gives the following error when trying to process it using soxi command:

soxi FAIL formats: can't open input file `175_2166file-idj5GxKHFgrNs.wav': WAVE: RIFF header not found

kavyamanohar avatar Jan 09 '22 17:01 kavyamanohar