lhotse icon indicating copy to clipboard operation
lhotse copied to clipboard

Check the number of expected utterances in data preparation

Open csukuangfj opened this issue 2 years ago • 1 comments

I suggest adding some checks like below to dataset preparation

assert len(manifests[part]['recordings']) == expected_number_of_recordings


# we can also check expected durations

With such checks, we can avoid issues in https://github.com/lhotse-speech/lhotse/issues/761

csukuangfj avatar Jul 04 '22 01:07 csukuangfj

Good idea!

pzelasko avatar Jul 04 '22 01:07 pzelasko