nonparaSeq2seqVC_code
nonparaSeq2seqVC_code copied to clipboard
Implementation code of non-parallel sequence-to-sequence VC
代码全是坑
Hi Jing-Xuan, Thanks for open-sourcing this! If you include some pre-trained models, (maybe hosted on git LFS), it would be very useful for the research / open source community. -josh
Hi Guys, first of all, thanks for sharing this great research. I have a question. Is it possible to offer a python "code" way to train / fine-tune the model?...
Hi, I wanted to modify the speaking rate of the voice conversion speech. As mentioned in your paper on forward attention, I was looking for transition agent to modify the...
Hi @jxzhanggg, I am trying to achieve Voice Conversion with this algorithm applied to prosody training. This means that I want to convert a reference audio (Speaker A) to the...
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.14 to 2.7.2. Release notes Sourced from tensorflow's releases. TensorFlow 2.7.2 Release 2.7.2 This releases introduces several vulnerability fixes: Fixes a code injection in saved_model_cli (CVE-2022-29216) Fixes...
It's unclear from the lack of comments in run.sh and the lack of a read-me file in the fine-tuning folder how to set up the Arctic data or other voice-data...
Hello, Could you please specify the steps to enable multi-GPU training, please? I set `distributed_run=True` in `hparams.py` and then set `--n_gpus=2` and `CUDA_VISIBLE_DEVICES=0,3` in file `run.sh` to select GPUs 0...
Hi, Zhang Could you please explain how the text encoder output and recognition encoder output align? it is stated in your paper as "The recognition encoder Er is a seq2seq...
Hello @jxzhanggg, First of all, thank you for your helpful replies to the previous issues I posted. I would like to adapt this voice conversion model to European Portuguese. The...