David Haws

Results 3 comments of David Haws

The wav clip the prompt is generated from is 12.66 seconds. I see in the FAQ that training is kept under 22 seconds. Moreover, the make_prompt function does not complain...

Hello. Do you have any follow up advice to improve the quality from clones voice samples? This is not only prompt I used, but one example.

I created a prompt using 3 seconds of audio. The generated audio has some problems. https://www.dropbox.com/scl/fi/ewj55pxg9lpgtsf7e6ie7/barackobamafederalplaza_3s.wav?rlkey=py7e9vd88r3fxdl4m4nxok2ii&dl=0 https://www.dropbox.com/scl/fi/kcqn4zqmr8en5a9eqdo30/obama_2_nose.wav?rlkey=cgzcx79u89xbydbsmmofnhjrg&dl=0