Aaron (Yinghao) Li

Results 110 comments of Aaron (Yinghao) Li

@CONGLUONG12 Probably yes, if speaker A has samples in the training set with similar emotions, otherwise it might not work.

I don't understand your question. What do you mean by does not match the actual sound?

This is insanely weird. I have tried to train it by multiplying the phase by torch.pi, but it fails to converge, while using the range from -1 to 1 works...

Thanks for fixing the requirements. However, your Colab notebook doesn't really work because you didn't actually download the pre-trained models. Instead, you copied them from your Google Drive. You can...

I did try training for other languages including Mandarin, Japanese, Hindi etc., though it requires a few changes: 1. You need to phonemize Chinese into IPAs. You can use either...

For Japanese, you can do the same thing: The conversion table from kana to IPA is the following (again phonemizer doesn't work for me). ```python kana_mapper = OrderedDict([ ("ゔぁ","bˈa"), ("ゔぃ","bˈi"),...

@c9412600 That was a typo that should not be included, I have fixed it. 155 is the speaker id (never used during training, just for clarification), and X means no...

@CONGLUONG12 I don't think there is any change needed for Vietnamese. You only need to find a conversion table between chu quoc ngu and IPA (maybe phonemizer works for this...

@yihuitang You need to code it yourself because the meldataset.py was written for English support only. I have provided the conversion table, so it should not be difficult for you...