lhotse
lhotse copied to clipboard
Check the number of expected utterances in data preparation
I suggest adding some checks like below to dataset preparation
assert len(manifests[part]['recordings']) == expected_number_of_recordings
# we can also check expected durations
With such checks, we can avoid issues in https://github.com/lhotse-speech/lhotse/issues/761
Good idea!