Retrieval-based-Voice-Conversion-WebUI icon indicating copy to clipboard operation
Retrieval-based-Voice-Conversion-WebUI copied to clipboard

Incorrect Pronunciation of single word 'He' after Conversion

Open canadDN opened this issue 1 year ago • 0 comments

Hello,

I encountered an issue with the RVC (1006NVIDIA) when converting a TTS-generated voice file. Specifically, the single word "He" is not pronounced correctly after conversion. Instead of the expected pronunciation, the output sounds more like "swee" or "sui."

Steps to Reproduce:

  1. I used a TTS engine to generate a voice file that includes the single word "He."
  2. I applied RVC to convert the voice file.
  3. The output consistently mispronounces "He" across different RVC models I tried.

What I've Tried:

  • I tested the conversion with different TTS-generated voice files, but the issue persists.
  • I used multiple RVC models to see if the problem was model-specific, but the result was the same.
  • I adjusted the settings in RVC, but the issue remains unresolved.

Expected Result:

The RVC conversion should accurately pronounce "He" as it is in the original file.

Actual Result:

The word "He" is incorrectly converted to something like "swee" or "sui."

I have attached the original he.wav file for reference. Any help or suggestions to resolve this issue would be greatly appreciated. he.zip

Thank you!

canadDN avatar Aug 31 '24 14:08 canadDN