raccoonML
raccoonML
Provide an example? It may be the quality of your recordings or your microphone setup, like distance from speaker or recording environment.
Which version of python are you using? It should be 3.6 or 3.7 at least.
@Archviz360 I spent a few hours getting the Swedish model to work and collected all required files at this link. The setup is a little tricky because it uses tensorflow...
The Swedish pretrained synthesizer model is not my work, but it is trained from scratch and uses the English speaker encoder as ViktorAlm explains in #257 . I made these...
Here is an idea for how to deal with numbers in other languages. Code not available at this time. https://github.com/sveneschlbeck/Multi-Language-RTVC/issues/13#issuecomment-1001240560
The output varies because dropout is used in inference, in the encoder and decoder prenets. Dropout causes some tensor elements to be zeroed out at random. Its purpose is to...
It's a command line argument for demo_cli.py and demo_toolbox.py. You also need to specify the value of the seed. For example: ``` python demo_cli.py --seed 0 python demo_toolbox.py --seed 0...
Notice how the synthesized output is now identical in each case, with a length of 172800 timesteps. Some variation in inference speed is normal, and does not affect output.
Start here https://github.com/CorentinJ/Real-Time-Voice-Cloning/issues/431#issuecomment-673555684 Here are the code changes for spanish, you can use these to get an idea what needs to be changed. https://github.com/raccoonML/Real-Time-Voice-Cloning/commit/e7aa707c976db0cebbdf2cc68f140edcdd31e7f4 If your text transcripts contain abbreviations...
Please try [my tutorial](https://github.com/raccoonML/Real-Time-Voice-Cloning/releases/tag/rtvc_upstream_2-3-22). Open an issue in my fork if anything is not working.