Cross-Speaker-Emotion-Transfer issues

Results 10 Cross-Speaker-Emotion-Transfer issues

Sort by recently updated

Synthesis with other person out of RAVDESS

Hello author, Firstly, thank you for giving this repo, it is really nice. I have a question that: 1. I download CMU data with single person with 100 audios and...

hathubkhn

preprocess.yaml correction

corpus_path: "output/ckpt/RAVDESS" raw_path: "output/ckpt/RAVDESS/450000.pth/data"

CrackerHax

Error using the pretrained model

I'm trying to run `synthesize` with the pretrained model, like such: ```bash python3 synthesize.py --text "This sentence is a test" --speaker_id Actor_01 --emotion_id neutral --restore_step 450000 --dataset RAVDESS --mode single...

jrings

speaker embedding npy file not found

Hi, I am facing the following issue while synthesizing using pretrained model. Removing weight norm... Traceback (most recent call last): File "synthesize.py", line 234, in )) if load_spker_embed else None...

raikarsagar

how to train Mandarin?

This project is great, how to train Mandarin? It seems that Mandarin is not supported, and there is no processing for Mandarin in the code.

fangg2021

Reasons for using HiFi-GAN or MelGAN as vocoder

I am curious as to why you used HiFi-GAN or MelGAN rather than the vocoder (WaveRNN) described in the paper. 안녕하세요, 코드를 공유해주셔서 감사합니다. 저는 본 코드에서 논문에 기재되어 있는...

BEOMSEOK-K

audio produced by pretrained model is not correct...

I download model from https://drive.google.com/drive/folders/1QszdJC7dzBrQHntiLxYcG8ewczvoK4q1, and test inference with command as bellow, python3 synthesize.py --text "Hello!" --speaker_id Actor_22 --emotion_id sad --restore_step 450000 --mode single --dataset RAVDESS the output audio obviously...

LifeOfCodeDesigner

the semi-supervised used

The current implementation is not trained in a semi-supervised way due to the small dataset size. But it can be easily activated by specifying target speakers and passing no emotion...

yiwei0730

The generated wav is not good

Hi, thank you for open source the wonderful work ! I followed your instructions 1) install `lightconv_cuda`, 2) download the [checkpoint](https://drive.google.com/drive/folders/1QszdJC7dzBrQHntiLxYcG8ewczvoK4q1), 3) download the [speaker embedding npy](https://drive.google.com/drive/folders/1a4YW2UWdlF9RTqG_phv_VbRjyEcAld7t). However, the generated...

pangtouyuqqq

한국어 데이터로 학습을 시키고 싶습니다

한국어 데이터를 학습하는 것을 시도중에 있습니다. 혹시 한국어 전처리가 포함된 코드의 공유가 가능할까요?

kimdanni

Cross-Speaker-Emotion-Transfer
Cross-Speaker-Emotion-Transfer copied to clipboard

Metadata

Synthesis with other person out of RAVDESS

preprocess.yaml correction

Error using the pretrained model

speaker embedding npy file not found

how to train Mandarin?

Reasons for using HiFi-GAN or MelGAN as vocoder

audio produced by pretrained model is not correct...

the semi-supervised used

The generated wav is not good

한국어 데이터로 학습을 시키고 싶습니다

← Metadata

Owner

Metadata

Cross-Speaker-Emotion-Transfer Cross-Speaker-Emotion-Transfer copied to clipboard

Metadata

← Metadata

Owner

Metadata

Cross-Speaker-Emotion-Transfer
Cross-Speaker-Emotion-Transfer copied to clipboard