RHVoice
RHVoice copied to clipboard
Support For The English RMS Voice
Hello,
I'm using the RH Voice add-on with the NVDA screen reader and think this synthesizer definitely has a lot of potential.
It's very fast and responsive and it's nice to see that this is being actively worked on.
The English support offers the ability to use the Bdl voice which is very good. There is another voice which is part of that same set called RMS which is also another very good English voice and I was wondering if it's possible to add this into the available English voices for RH Voice by chance please?
I generally prefer this one because I feel it has slightly clearer pronunciations and so having this as another option would be a nice edition.
I want to thank everyone involved for their work on this synthesizer as it's continually improving.
Thank you very much for your time and I look forward to any input you may have concerning this.
Sincerely,
Brandon Tyson
Hello.
I think we at RHVoice lab could try to train this voice, but the problem is that i wasn't able to find high quality dataset for that voice, only 16-khz
Hello,
I think that it wouldn't sound bad if you were to try and train it with the data even though it's only at 16 kHZ. The reason I think this would be ok is that another synthesizer, MaryTTS, also uses this voice, and at least when I hear that, it sounds like a very similar synthesis method to what RH Voice is doing.
Thanks again for the quick response and I look forward to any input you may have.
Sincerely,
Brandon Tyson
On 2/22/22, Beqa Gozalishvili @.***> wrote:
Hello.
I think we at RHVoice lab could try to train this voice, but the problem is that i wasn't able to find high quality dataset for that voice, only 16-khz
-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1047784354 You are receiving this because you authored the thread.
Message ID: @.***>
-- “Be what you are. This is the first step towards becoming better than you are.” – J. C. Hare & A. W. Hare
Hi Beqa,
Let me try to find it and train it.
At least, there should be 32 k original version.
As i said, there is no other versions on festival website.
On 2/22/22, Zvonimir Stanečić @.***> wrote:
Hi Beqa,
Let me try to find it and train it.
At least, there should be 32 k original version.
-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048008900 You are receiving this because you commented.
Message ID: @.***>
-- with best regards Beqa Gozalishvili Tell: +995593454005 Email: @.*** Web: https://gozaltech.org Skype: beqabeqa473 Telegram: https://t.me/gozaltech facebook: https://facebook.com/gozaltech twitter: https://twitter.com/beqabeqa473 Instagram: https://instagram.com/beqa.gozalishvili
One small hint:
During the training, you cannot apply world. During resynthesis, you will get only distorted buzzing.
Only old vocoder works there.
From: Beqa Gozalishvili @.> Sent: Tuesday, February 22, 2022 8:22 PM To: RHVoice/RHVoice @.> Cc: Zvonimir Stanečić @.>; Comment @.> Subject: Re: [RHVoice/RHVoice] Support For The English RMS Voice (Issue #482)
As i said, there is no other versions on festival website.
On 2/22/22, Zvonimir Stanečić @.*** mailto:***@***.*** > wrote:
Hi Beqa,
Let me try to find it and train it.
At least, there should be 32 k original version.
-- Reply to this email directly or view it on GitHub: https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048008900 You are receiving this because you commented.
Message ID: @.*** mailto:***@***.*** >
-- with best regards Beqa Gozalishvili Tell: +995593454005 Email: @.*** mailto:***@***.*** Web: https://gozaltech.org Skype: beqabeqa473 Telegram: https://t.me/gozaltech facebook: https://facebook.com/gozaltech twitter: https://twitter.com/beqabeqa473 Instagram: https://instagram.com/beqa.gozalishvili
— Reply to this email directly, view it on GitHub https://github.com/RHVoice/RHVoice/issues/482#issuecomment-1048132774 , or unsubscribe https://github.com/notifications/unsubscribe-auth/ACVCDE73GQE5APSZCFDPASDU4PO63ANCNFSM5PAUGT7Q . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub . You are receiving this because you commented. https://github.com/notifications/beacon/ACVCDE6EDHYQJCI3AXIXZY3U4PO63A5CNFSM5PAUGT72YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOHZ4TZJQ.gif Message ID: @.*** @.***> >
Isn't the training data for some of the other voices such as BDL 16 K?
Yes, bdl was trained using 16 K. I am preparing a fix for this issue. I am experimenting with some things.
Any updates on this issue? I quite like this voice, and would love to see it in RHVoice
Update of this issue: due to lack of audio data, for 48 k training with new methods and approaches, please close this issue. According to my training, 16 k is worse, compared to new voices we recently train.
@zstanecic, would it be lower quality than the Alan voice? If not, I see no reason to close it. I heard RMS in Festival, and it honestly sounded fine.
RMS is the lower quality than Alan. It cannot be even compared.
Hi @zstanecic, I have attached an audio recording comparing the two voices. When hearing Alan, it definitely sounds downsampled, whereas RMS does not (it sounds similar to SLT and BDL) at least to my ear.
https://www.dropbox.com/s/vrfu8bt5xdyoi5w/RMSDemo2022-08-08_09-11-09.flac?dl=1
@TheQuinbox those were the hts voices. This is slitely off topic but I need the latest festival addon for nvda 2021.3.5 and the way of using both clustergen and hts voices. Thanks Also, rhvooice's version of alan has a lower quality than that of the one I've hird, or at least I think so
This voice cannot be trained using pyworld. @alex19EP please close this issue as not planned.