VoiceCraft
VoiceCraft copied to clipboard
Usage instructions.
(really really impressed by the demo, so much further than the best SOTA model I found so far, congrats on the great work).
Running with docker/jupyter.
I followed the Docker/jupyter instructions, to the letter (I'm not at all familiar with jupyter, very with docker).
It went mostly well.
I keep running cells/advancing, again and again, until I get at the bottom.
And then nothing? What's supposed to happen? I don't see any new instructions, no new files, anything, I'm fairly lost.
Running as a script.
The jupyter stuff is great to get to know the project, but (unless I don't understand what jupyter is), it won't really help getting voicecraft integrated into my project / enable me to generate thousands of files / "call" voicecraft programmatically from my nodejs system.
In other, there is something like:
python3 voicecraft/bin/inference.py --text="Read this text" --model_path="voicecraft/model/file.something" --voice_sample="/tmp/voices/robert.wav" --output="/tmp/sample_voicecraft_output.wav" --device=cpu
What's the equivalent for voicecraft, and how do I get to the point where it'll agree to run? (running inside docker is fine, or outside docker too, just need to get it to run).
I found main.py
, and I think the options for the command line are in config.py
, but I don't know which options I need and which I don't / I don't know how to use the script. I didn't find an example of how to use it, but I'll keep looking.
Intonation.
I might be getting a bit ahead of myself here since I don't have it running yet, but maybe you know: will intonation/style transfer through? Like if my voice sample has the person whispering, will the output be whispering? Same for shouting, crying, etc. That's really the big thing missing from my system, is there any way to get that to work with voicecraft, do you know?
Thanks a lot in advance! Cheers.