icefall
icefall copied to clipboard
A question about the data preparation on AMI corpus
Hello, thank you for making such a great toolkit.
I am reproducing pruned_transducer_stateless7 on AMI corpus. When I applied GSS enhancer to training data (prepare.sh: stage 3, local/prepare_ami_gss.sh: stage 4), I encountered an assertion error below.
Original Traceback (most recent call last): File "/opt/conda/lib/python3.10/site-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop data = fetcher.fetch(index) File "/opt/conda/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 53, in fetch data = self.dataset[possibly_batched_index] File "/opt/conda/lib/python3.10/site-packages/gss/utils/data_utils.py", line 49, in getitem self._validate(cuts) File "/opt/conda/lib/python3.10/site-packages/gss/utils/data_utils.py", line 112, in _validate validate(cuts) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 39, in validate validator(obj, read_data=read_data) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 510, in validate_cut_set validate_cut(c, read_data=read_data) File "/opt/conda/lib/python3.10/site-packages/lhotse/qa.py", line 384, in validate_cut assert is_equal_or_contains(c.recording.channel_ids, c.channel) AssertionError
I found that the audio begin with IS*
IN*
, and IB*
have only 4-channel but not 8-channel in Array2 (ex. https://groups.inf.ed.ac.uk/ami//AMICorpusMirror/amicorpus/IS1000a/audio/).
However, egs/ami/ASR/local/prepare_ami_gss.sh
stage4 looks to assume that training data has 8-channel audio. I guess the assertion error was caused by this.
I would like to ask you when you implement egs/ami/ASR/pruned_transducer_stateless7, were there 8-channel audio of Array2 in IS*
IN*
, and IB*
?
Thank you.