Real-Time-Voice-Cloning
Real-Time-Voice-Cloning copied to clipboard
swedish models
since blue fish is gone and the page where the installation instruction with Swedish models are gone and even all dropbox is gone I wonder if there are some others here that might have it and could upload it again?? since I'm don't understand how to train or even have the right pc specs and time I would appreciate if someone with all the files including Swedish model could upload or contact me personally and send the files.
@Archviz360 I spent a few hours getting the Swedish model to work and collected all required files at this link. The setup is a little tricky because it uses tensorflow 1.15. Enjoy! https://github.com/raccoonML/Real-Time-Voice-Cloning/releases/tag/Swedish-1
@raccoonML I was wondering if you changed the symbols.py for swedish! Because I have to change che _characters for italian but it gives me error:
size mismatch for encoder.embedding.weight: copying a param with shape torch.Size([66, 512]) from checkpoint, the shape in current model is torch.Size([74, 512])
Wich models did you start off? did you train synthesizer from scratch?
The Swedish pretrained synthesizer model is not my work, but it is trained from scratch and uses the English speaker encoder as ViktorAlm explains in #257 . I made these updates to symbols.py: https://github.com/raccoonML/Real-Time-Voice-Cloning/commit/b358eb5d8f64a00cfee32a0e46ce4122355f7d41
Updating the symbol list will require training a new synthesizer model from scratch.
@raccoonML Hi! Yes I started retraining the synthétiser, I was wondering how you felt with NUMBERS because I know there are libraries that convert digits to English numbers but not other languages! How did you deal? Thank you
Here is an idea for how to deal with numbers in other languages. Code not available at this time. https://github.com/sveneschlbeck/Multi-Language-RTVC/issues/13#issuecomment-1001240560
@raccoonML Thank you! yes one idea was to then translate! :)
@raccoonML Sorry for bothering, When retraining the synthetizer I also need to re do the preprocessing of the dataset with the new symbols right? Because I retrained on and older pre processed dataset and it is not pronouncing the new letters :(
PS C:\Users\dacia\OneDrive\Skrivbord\RTVC-Swedish> python demo_toolbox.py
2022-08-19 21:21:27.615361: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'cudart64_100.dll'; dlerror: cudart64_100.dll not found
2022-08-19 21:21:27.615492: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
Traceback (most recent call last):
File "demo_toolbox.py", line 2, in
- Downgrade the protobuf package to 3.20.x or lower.
- Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).
More information: https://developers.google.com/protocol-buffers/docs/news/2022-05-06#python-updates
how do i fix this?