DiffGAN-TTS issues

Results 19 DiffGAN-TTS issues

Sort by recently updated

License Issue

Hi @keonlee9420, this software depends on [`praat-parselmouth`](https://github.com/YannickJadoul/Parselmouth) which is GPL-licensed, which means all software that depends on it must also be GPL-licensed. Might it be possible to switch to [DeepPhonemizer](https://github.com/as-ideas/DeepPhonemizer),...

fakerybakery

FileNotFoundError: [Errno 2] No such file or directory: './preprocessed_data/VCTK/spker_embed/p225-spker_embed.npy'

When I am trying to run inference with the VCTK dataset, I am getting this error. Do we need to prepare speaker embeddings ourselves even with pre-trained VCTK models? Thanks!...

liwangd

about the preprocessed data VCTK

I encountered some problems again VCTK dataset, I followed the process but UnboundLocalError: local variable 'f0' referenced before assignment, I wonder if it is possible to package the VCTK dataset...

yyh565655555

How many days taken to train DiffGAN-TTS

Hi Keonlee, May I know how many days taken you to train Diff GAN-TTS and please give information regarding GPU specification.

Mani0418

Checkpoints for Mandarin

Dear Keon Lee, I am a research assistant at the City University of Hong Kong, I currently conduct research related to neurolinguistics and appreciate your work about text to speech...

HLSUD

process data

hi, when i use VCTK dataset, process has a problem called "UnboundLocalError: local variable 'f0' referenced before assignment" but using LJSpeech is ok. By the way ,when i train the...

yyh565655555

Implementation performance

Hi, thank you very much for your great work! I was wondering if you conduct any evaluations on the model performance and voice quality for multi-speaker results, e.g. MOS or...

LeyuanQu

Some of the problems that occur in training

Hi@keonlee9420, I encountered some problems during the training stage. I often have loss functions that occasionally fluctuate a lot during training, even from around 3 to tens or hundreds. After...

Dyongh613

Why minmize l1(\hat{x_0}, x_0)+l1(\hat{x_1}, x_0) when optimizing aux model?

Hi, keonlee. Thanks for sharing code! I found that when training aux model, we get \hat{x_0} from G, then diffuse it to \hat{x_1}, finally get a prediciton list [ \hat{x_0},...

caisikai

DiffGAN-TTS
DiffGAN-TTS copied to clipboard

Metadata

License Issue

FileNotFoundError: [Errno 2] No such file or directory: './preprocessed_data/VCTK/spker_embed/p225-spker_embed.npy'

about the preprocessed data VCTK

How many days taken to train DiffGAN-TTS

Checkpoints for Mandarin

process data

Implementation performance

Some of the problems that occur in training

Why minmize l1(\hat{x_0}, x_0)+l1(\hat{x_1}, x_0) when optimizing aux model?

← Metadata

Owner

Metadata

DiffGAN-TTS DiffGAN-TTS copied to clipboard

Metadata

← Metadata

Owner

Metadata

DiffGAN-TTS
DiffGAN-TTS copied to clipboard