cross_vc icon indicating copy to clipboard operation
cross_vc copied to clipboard

what have you modify in this repo to support cross language ?

Open emmacirl opened this issue 6 years ago • 7 comments

@Kyubyong Thanks for your contribution! I found that the ideas that net1 for recognition and net2 for synthesis is similar with 'deep voice conversion'. So I am wondering what have you modify to support cross language? Looking forward to your reply

emmacirl avatar Mar 19 '18 12:03 emmacirl

Actually nothing but one: replacing single phonemes with triphones. I know it's not enough for this task, so I'm looking for a better architecture/model.

Kyubyong avatar Mar 19 '18 12:03 Kyubyong

@Kyubyong Thanks a lot for your fast reply! So in order to support cross language different language database is applied to train net2, is it ?

emmacirl avatar Mar 19 '18 12:03 emmacirl

No for training always English speech samples are uswd. For inference different languages are to be tested.

Kyubyong avatar Mar 19 '18 12:03 Kyubyong

The TIMIT is labeled with triphones, and Net2 is trained with English. Finally voice with any language can be synthesized?

emmacirl avatar Mar 19 '18 12:03 emmacirl

Exactly.

Kyubyong avatar Mar 19 '18 12:03 Kyubyong

Got it !

emmacirl avatar Mar 19 '18 12:03 emmacirl

Thank you for your contribution. How long did it take you to train Net1? thank U Looking forward to your reply!

lightwithshadow avatar Oct 29 '18 14:10 lightwithshadow