icefall icon indicating copy to clipboard operation
icefall copied to clipboard

A question about the data preparation on AMI corpus

Open hiranoyu0830 opened this issue 10 months ago • 0 comments

Hello, thank you for making such a great toolkit.

I am reproducing pruned_transducer_stateless7 on AMI corpus. When I applied GSS enhancer to training data (prepare.sh: stage 3, local/prepare_ami_gss.sh: stage 4), I encountered an assertion error below.

Original Traceback (most recent call last): File "/opt/conda/lib/python3.10/site-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop data = fetcher.fetch(index) File "/opt/conda/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 53, in fetch data = self.dataset[possibly_batched_index] File "/opt/conda/lib/python3.10/site-packages/gss/utils/data_utils.py", line 49, in getitem self._validate(cuts) File "/opt/conda/lib/python3.10/site-packages/gss/utils/data_utils.py", line 112, in _validate validate(cuts) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 39, in validate validator(obj, read_data=read_data) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 510, in validate_cut_set validate_cut(c, read_data=read_data) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 384, in validate_cut assert is_equal_or_contains(c.recording.channel_ids, c.channel) AssertionError

I found that the audio begin with IS* IN*, and IB* have only 4-channel but not 8-channel in Array2 (ex. https://groups.inf.ed.ac.uk/ami//AMICorpusMirror/amicorpus/IS1000a/audio/). However, egs/ami/ASR/local/prepare_ami_gss.sh stage4 looks to assume that training data has 8-channel audio. I guess the assertion error was caused by this.

I would like to ask you when you implement egs/ami/ASR/pruned_transducer_stateless7, were there 8-channel audio of Array2 in IS* IN*, and IB*?

Thank you.

hiranoyu0830 avatar Apr 03 '24 09:04 hiranoyu0830