ULCA-asr-dataset-corpus
ULCA-asr-dataset-corpus copied to clipboard
File format issue in Malayalam dataset
The Malayalam dataset in the categories: dd_malayalam, joshtalks and The_Cue can not be played or processed due to some file formatting issue.
It gives the following error when trying to process it using soxi command:
soxi FAIL formats: can't open input file `175_2166file-idj5GxKHFgrNs.wav': WAVE: RIFF header not found