VQMIVC
VQMIVC copied to clipboard
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Hi, I wonder why you normalize f0 series before feeding to the f0encoder in convert.py. However, this kind of normalization for f0 isn't used in preprocessing phase.
@andreasjansson @Wendison Hello, sorry to interrupt you! I'm a rookie of voice model. I have trained the model in VCTK-Corpus-0.92.zip dataset by "python3 train.py use_CSMI=True use_CPMI=True use_PSMI=True" in NVIDIA V100S....
Dear PHD: I try to train a vocoder, and I have installed parallelwavegan,and I run the command: run.sh,however it came out with the traceback: Traceback (most recent call last): File...
Dear PHD: I have already install the ParallelWaveGAN,however,when I run the egs/vctk/voc1 run.sh, it comes with the error: Stage 0: Data preparation Successfully split data directory. ERROR: num_first + num_second...
Hi, I try to reproduce your paper and I encounter a problem with mel stats and vocoder. When I use your pre-trained vocoder and mel stats, I can notice the...
What does this mean? Am I doing something wrong?
Hi, @Wendison Thank you so much for your excellent work. very nice paper. When I saw this reply on the below issues, it helped me to motivate to go further....
Dear PHD: Could you tell me what do z_dim:64 and c_dim:256 in config/model/default stand for?And what n_embeddings: 512 in config/model/default stand for?Thank you very much.
Hi @Wendison Thank you so much for this great work. I fine-tuned (resumed) pretrained model (use_CSMI=True use_CPMI=True use_PSMI=True) with indicTTS dataset (20 speakers - each having 1 hour audios) the...