Rishikesh (ऋषिकेश)

Results 182 comments of Rishikesh (ऋषिकेश)

I am also planing to implement same

Are you treating phoneme duration as a classification task as phoneme duration is discrete value not continuous and more or less, they range between 0 to 50 at max?

if you use a standard token and predict token, you treat it as a classification task which I also support. It should be fast because I don't think it is...

Yes, when you predict duration using duration predictor it always come fast no matter what, in some case it comes out normal. One way to tackle this problem is to...

> I might need to avoid two sequences in parallel and instead switch between duration and phoneme prediction to make duration dependent on phoneme... Yes.

@ex3ndr Samples sounds decent 👍🏽

Some initial feedback: * Issue with special characters like `-` for example, it take long pause between open and source while pronouncing `open-source`. * Issue while pronouncing Abbreviated words like...

I trained MFA on espeak phonemes, if you ask I can share english trained MFA on Espeak IPA.

@ex3ndr have you tried Gaussian Upsampling for length regulator from sampled durations ?

Hi @ex3ndr Have you tested this model only for auto-regressive duration predictor? Like I give input simply text and predict phonemes along with duration does not pitch. As per logic,...