dc_tts icon indicating copy to clipboard operation
dc_tts copied to clipboard

How to run pre-trained models to generate voice e.g. Joe Rogan?

Open zarifaziz opened this issue 5 years ago • 2 comments

Hi, Can you please provide instructions on how to use this repo and generate speech from input text? Sorry I'm just having a bit of trouble figuring out how to run it.

Also is there a pre-trained model of Joe Rogan? How would I go about building this? Thanks!

zarifaziz avatar Aug 08 '19 04:08 zarifaziz

+1

NoahZu avatar Nov 19 '19 12:11 NoahZu

If I remember correctly, you extract the pretrained voice into the logdir folder and run "python synthesize.py" in the command prompt. You can edit the "harvard_sentences.txt" file to make it say whatever you want. (just don't delete the numbers) Then a folder called "samples" is generated in the dc_tts-master folder with the wav files.

As for how to make a voice, I used the LJ speech dataset as a template to create my own dataset. I couldn't train it because I don't have enough ram at the moment, so I'll have to wait to see if it works.

edit: If you just want to mess around with creating/cloning voices, I would recommend this model. You don't need to invest a ton of time into a single voice.

Traincraft101 avatar Nov 27 '19 20:11 Traincraft101