crnn-lid icon indicating copy to clipboard operation
crnn-lid copied to clipboard

Changing to shorter segments

Open Themba4Sho opened this issue 3 years ago • 1 comments

What do I need to change to train on shorter segments? I used data with minimum length of 3 seconds, but the wav_to_spec module still processes speech with only 10 seconds or more.

Themba4Sho avatar Jun 28 '22 10:06 Themba4Sho

In case you still want answer:

  1. Go to config.yaml.
  2. See input shape parameter like [129, 500, 1].
  3. Set segment_length as 3. This is minimal audio lenght in seconds
  4. Set pixel_per_second = 500 / segment_length. In general you must provide input_shape[1] = segment_length * pixel_per_second by any means you wish.

Vadim2S avatar Jan 16 '23 07:01 Vadim2S