Real-Time-Voice-Cloning Quality of generated audio

Quality of generated audio

Open aryanpanpalia opened this issue 3 years ago • 2 comments

trafficstars

When using a recording from the LibriSpeech downloaded dataset, a good ratio of the generated audio pieces sound good and accurate. However, whenever I record some audio and use that, no matter who the speaker is, all the generated audio pieces sound the same. Is there any way I can fix this, or am I not understanding how to use this tool correctly? I've seen others on YouTube using the tool the same way I am and the resulting audio clips sound far better than my own,

May 01 '22 20:05 aryanpanpalia

Provide an example? It may be the quality of your recordings or your microphone setup, like distance from speaker or recording environment.

May 06 '22 06:05 raccoonML

Use single-speaker fine-turning as described in #437

May 25 '22 20:05 TrycsPublic

#437 has a dropbox link which dont exist so kinda hard to reproduce

Oct 21 '22 17:10 scatterp2

Real-Time-Voice-Cloning Real-Time-Voice-Cloning copied to clipboard

Quality of generated audio

Real-Time-Voice-Cloning
Real-Time-Voice-Cloning copied to clipboard