RHVoice icon indicating copy to clipboard operation
RHVoice copied to clipboard

Support For The English RMS Voice

Open btman16 opened this issue 3 years ago • 13 comments

Hello,

I'm using the RH Voice add-on with the NVDA screen reader and think this synthesizer definitely has a lot of potential.

It's very fast and responsive and it's nice to see that this is being actively worked on.

The English support offers the ability to use the Bdl voice which is very good. There is another voice which is part of that same set called RMS which is also another very good English voice and I was wondering if it's possible to add this into the available English voices for RH Voice by chance please?

I generally prefer this one because I feel it has slightly clearer pronunciations and so having this as another option would be a nice edition.

I want to thank everyone involved for their work on this synthesizer as it's continually improving.

Thank you very much for your time and I look forward to any input you may have concerning this.

Sincerely,

Brandon Tyson

btman16 avatar Feb 22 '22 08:02 btman16

Hello.

I think we at RHVoice lab could try to train this voice, but the problem is that i wasn't able to find high quality dataset for that voice, only 16-khz

beqabeqa473 avatar Feb 22 '22 13:02 beqabeqa473

Hello,

I think that it wouldn't sound bad if you were to try and train it with the data even though it's only at 16 kHZ. The reason I think this would be ok is that another synthesizer, MaryTTS, also uses this voice, and at least when I hear that, it sounds like a very similar synthesis method to what RH Voice is doing.

Thanks again for the quick response and I look forward to any input you may have.

Sincerely,

Brandon Tyson

On 2/22/22, Beqa Gozalishvili @.***> wrote:

Hello.

I think we at RHVoice lab could try to train this voice, but the problem is that i wasn't able to find high quality dataset for that voice, only 16-khz

-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1047784354 You are receiving this because you authored the thread.

Message ID: @.***>

-- “Be what you are. This is the first step towards becoming better than you are.” – J. C. Hare & A. W. Hare

btman16 avatar Feb 22 '22 14:02 btman16

Hi Beqa,

Let me try to find it and train it.

At least, there should be 32 k original version.

zstanecic avatar Feb 22 '22 16:02 zstanecic

As i said, there is no other versions on festival website.

On 2/22/22, Zvonimir Stanečić @.***> wrote:

Hi Beqa,

Let me try to find it and train it.

At least, there should be 32 k original version.

-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048008900 You are receiving this because you commented.

Message ID: @.***>

-- with best regards Beqa Gozalishvili Tell: +995593454005 Email: @.*** Web: https://gozaltech.org Skype: beqabeqa473 Telegram: https://t.me/gozaltech facebook: https://facebook.com/gozaltech twitter: https://twitter.com/beqabeqa473 Instagram: https://instagram.com/beqa.gozalishvili

beqabeqa473 avatar Feb 22 '22 19:02 beqabeqa473

One small hint:

During the training, you cannot apply world. During resynthesis, you will get only distorted buzzing.

Only old vocoder works there.

From: Beqa Gozalishvili @.> Sent: Tuesday, February 22, 2022 8:22 PM To: RHVoice/RHVoice @.> Cc: Zvonimir Stanečić @.>; Comment @.> Subject: Re: [RHVoice/RHVoice] Support For The English RMS Voice (Issue #482)

As i said, there is no other versions on festival website.

On 2/22/22, Zvonimir Stanečić @.*** mailto:***@***.*** > wrote:

Hi Beqa,

Let me try to find it and train it.

At least, there should be 32 k original version.

-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048008900 You are receiving this because you commented.

Message ID: @.*** mailto:***@***.*** >

-- with best regards Beqa Gozalishvili Tell: +995593454005 Email: @.*** mailto:***@***.*** Web: https://gozaltech.org Skype: beqabeqa473 Telegram: https://t.me/gozaltech facebook: https://facebook.com/gozaltech twitter: https://twitter.com/beqabeqa473 Instagram: https://instagram.com/beqa.gozalishvili

— Reply to this email directly, view it on GitHub https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048132774 , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACVCDE73GQE5APSZCFDPASDU4PO63ANCNFSM5PAUGT7Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub . You are receiving this because you commented. https://github.com/notifications/beacon/ACVCDE6EDHYQJCI3AXIXZY3U4PO63A5CNFSM5PAUGT72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOHZ4TZJQ.gif Message ID: @.*** @.***> >

zstanecic avatar Feb 22 '22 19:02 zstanecic

Isn't the training data for some of the other voices such as BDL 16 K?

datajake1999 avatar Feb 24 '22 17:02 datajake1999

Yes, bdl was trained using 16 K. I am preparing a fix for this issue. I am experimenting with some things.

zstanecic avatar Mar 20 '22 14:03 zstanecic

Any updates on this issue? I quite like this voice, and would love to see it in RHVoice

trypsynth avatar Jul 29 '22 20:07 trypsynth

Update of this issue: due to lack of audio data, for 48 k training with new methods and approaches, please close this issue. According to my training, 16 k is worse, compared to new voices we recently train.

zstanecic avatar Aug 08 '22 08:08 zstanecic

@zstanecic, would it be lower quality than the Alan voice? If not, I see no reason to close it. I heard RMS in Festival, and it honestly sounded fine.

trypsynth avatar Aug 08 '22 13:08 trypsynth

RMS is the lower quality than Alan. It cannot be even compared.

zstanecic avatar Aug 08 '22 14:08 zstanecic

Hi @zstanecic, I have attached an audio recording comparing the two voices. When hearing Alan, it definitely sounds downsampled, whereas RMS does not (it sounds similar to SLT and BDL) at least to my ear.

https://www.dropbox.com/s/vrfu8bt5xdyoi5w/RMSDemo2022-08-08_09-11-09.flac?dl=1

trypsynth avatar Aug 08 '22 15:08 trypsynth

@TheQuinbox those were the hts voices. This is slitely off topic but I need the latest festival addon for nvda 2021.3.5 and the way of using both clustergen and hts voices. Thanks Also, rhvooice's version of alan has a lower quality than that of the one I've hird, or at least I think so

king-dahmanus avatar Aug 09 '22 13:08 king-dahmanus

This voice cannot be trained using pyworld. @alex19EP please close this issue as not planned.

zstanecic avatar Aug 22 '22 15:08 zstanecic