SpeechSplit Does it will work on unseen data also? will it be able to convert voice of unseen speaker with different content than that of data in training, will we obtain the disentanglement?

Jan 02 '21 05:01 pycodebook

I have the same doubt, can someone please clarify. Thanks in advance.

Jul 07 '21 12:07 OSSome01

I have the same question.

Nov 22 '21 07:11 zhouyong64

You can make it generalize to unseen speakers by training it the same way as AutoVC.

Nov 22 '21 13:11 auspicious3000

@auspicious3000 could you explain what you mean by 'training it the same way as AutoVC"?

Repeat all steps from here https://github.com/auspicious3000/autovc#2train-model ?

Or change make_metadata.py in SpeechSplit to embed speaker encodings but train using model from SpeechSplit?

Feb 07 '22 15:02 skol101

@skol101 it means training with generalized speaker embeddings instead of one-hot embeddings

Feb 07 '22 16:02 auspicious3000

I did that -- used make_metadata.py from AutoVC. Now I have removed validation part from the solver.py in this repo, because there's nothing to validate it against (as in solver_encoder.py in AutoVC) and started training.

Am I doing this correctly? Your help is very appreciated.

Feb 07 '22 18:02 skol101

sounds correct, but you don't need to remove the validation part

Feb 07 '22 19:02 auspicious3000

I had to remove validation part because I couldn’t figure out yet how to create my validatation pkl file yet based on vctk plus my custom voices. On 7 Feb 2022, 21:07 +0200, Kaizhi Qian @.***>, wrote:

sounds correct, but you don't need to remove the validation part — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you were mentioned.Message ID: @.***>

Feb 07 '22 19:02 skol101