crnn-lid
crnn-lid copied to clipboard
Changing to shorter segments
What do I need to change to train on shorter segments? I used data with minimum length of 3 seconds, but the wav_to_spec module still processes speech with only 10 seconds or more.
In case you still want answer:
- Go to config.yaml.
- See input shape parameter like [129, 500, 1].
- Set segment_length as 3. This is minimal audio lenght in seconds
- Set pixel_per_second = 500 / segment_length. In general you must provide input_shape[1] = segment_length * pixel_per_second by any means you wish.