librispeech-alignments icon indicating copy to clipboard operation
librispeech-alignments copied to clipboard

How was the aligner configured?

Open stephenmelsom opened this issue 5 years ago • 7 comments

I wanted to try and replicate these alignments, but it looks like the timestamps were different than yours. Were you using the default configuration or did you make any changes?

Thanks!

stephenmelsom avatar Feb 07 '20 21:02 stephenmelsom

Which alignments did you check? I used default parameters for the textgrid ones, but I applied a cleaning script to get the txt ones. I don't have that script anymore unfortunately.

CorentinJ avatar Feb 08 '20 20:02 CorentinJ

I checked both just to be sure. I noticed the discrepancy after trying to run my own cleaned alignments through synthesizer preprocessing script in your sv2tts repo. There's an assertion in the split_on_silences function that checks if the first and last words are silences and that's where I errored out.

I'm wondering if there were updates to the MFA that may have caused this.

stephenmelsom avatar Feb 10 '20 14:02 stephenmelsom

Ah right, sorry I didn't remember that until you mentioned it. Yes, I normalized everything in such a way that a sentences ends and starts with a silence, even if it's a 0-duration one. It was just out of convenience, I can't really remember why.

Silences are represented as empty words, e.g. in the first sentence there is a silence from 0s to 0.49s and the word 'GO' is pronounced from 0.49s to 0.89s. Each sentence is guaranteed to start and end with a silence, even if its duration is 0, this is for parsing convenience.

CorentinJ avatar Feb 10 '20 14:02 CorentinJ

Do you happen to recall (generally) how you normalized those sentences to determine those silences?

stephenmelsom avatar Feb 10 '20 17:02 stephenmelsom

Hi @sjmelsom, I want to create alignments on VCTK dataset, and I have no idea how to use the MFA. Is it possible to share your work regarding the creation of alignments?

stray128 avatar Mar 03 '20 12:03 stray128

@stray128 I would recommend that you head over to the MFA docs. They provide a reasonable amount of material that will help you get started.

stephenmelsom avatar Mar 03 '20 13:03 stephenmelsom

Thanks, @sjmelsom. Also, I have another question for you. did you train the synthesizer model on the alignments you generated? If so, Were the results good when you surpass the number of steps that @CorentinJ trained?

stray128 avatar Mar 04 '20 12:03 stray128