nonparaSeq2seqVC_code issues

Include pre-trained models

4

Hi Jing-Xuan, Thanks for open-sourcing this! If you include some pre-trained models, (maybe hosted on git LFS), it would be very useful for the research / open source community. -josh

JRMeyer

Feature Request

1

Hi Guys, first of all, thanks for sharing this great research. I have a question. Is it possible to offer a python "code" way to train / fine-tune the model?...

ChrisDelClea

Transition agent in forward attention

1

Hi, I wanted to modify the speaking rate of the voice conversion speech. As mentioned in your paper on forward attention, I was looking for transition agent to modify the...

narendranp

Keeping prosodic features of reference Speaker

4

Hi @jxzhanggg, I am trying to achieve Voice Conversion with this algorithm applied to prosody training. This means that I want to convert a reference audio (Speaker A) to the...

jucasansao

Bump tensorflow from 1.14 to 2.7.2

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.14 to 2.7.2. Release notes Sourced from tensorflow's releases. TensorFlow 2.7.2 Release 2.7.2 This releases introduces several vulnerability fixes: Fixes a code injection in saved_model_cli (CVE-2022-29216) Fixes...

dependabot[bot]

dependencies

Fine-tuning help

1

It's unclear from the lack of comments in run.sh and the lack of a read-me file in the fine-tuning folder how to set up the Arctic data or other voice-data...

leonardw86

Multi-GPU training

3

Hello, Could you please specify the steps to enable multi-GPU training, please? I set `distributed_run=True` in `hparams.py` and then set `--n_gpus=2` and `CUDA_VISIBLE_DEVICES=0,3` in file `run.sh` to select GPUs 0...

ivancarapinha

The mechanism of alignment between text encoder output and audio_seq2seq output

1

Hi, Zhang Could you please explain how the text encoder output and recognition encoder output align? it is stated in your paper as "The recognition encoder Er is a seq2seq...

inconnu11

Training the model for a different language

1

Hello @jxzhanggg, First of all, thank you for your helpful replies to the previous issues I posted. I would like to adapt this voice conversion model to European Portuguese. The...

ivancarapinha

nonparaSeq2seqVC_code
nonparaSeq2seqVC_code copied to clipboard

Metadata

代码全是坑

Include pre-trained models

Feature Request

Transition agent in forward attention

Keeping prosodic features of reference Speaker

Bump tensorflow from 1.14 to 2.7.2

Fine-tuning help

Multi-GPU training

The mechanism of alignment between text encoder output and audio_seq2seq output

Training the model for a different language

← Metadata

Owner

Metadata

nonparaSeq2seqVC_code nonparaSeq2seqVC_code copied to clipboard

Metadata

← Metadata

Owner

Metadata

nonparaSeq2seqVC_code
nonparaSeq2seqVC_code copied to clipboard