DiffGAN-TTS issues

Results 19 DiffGAN-TTS issues

Sort by recently updated

Is adversarial training actually necessary?

I realise that when I remove adversarial loss and feature match loss, it still works well and has no degeneration of performance. This makes me question the role of adversarial...

nguyenhungquang

How to adopt these codes to another non English language

supernirmesh

Can I ask you some questions about mel-spectrogram?

HI@keonlee9420, I have some questions to ask you about the mel-spectrogram. In the picture, ![image](https://user-images.githubusercontent.com/94910118/176336644-a71a4bae-117b-4557-9dfb-ec8b32ebe3f1.png) The above mel-spectrogram alignment has been generated, but the horizontal details have not been released...

Dyongh613

Can we just use FastSpeech for inference as baseline result

Hi Keon, thanks so much for sharing this wonderful project. I am wondering can we just use the FastSpeech part for inference? Looking forward to your reply

Maoshuiyang

About DiffSVC

Hello, sorry for bothering you. Have you contacted with DiffSVC? I saw a code for DiffSVC is similar with yours but it is uncompleted.

guoyingying432

On Input Output Convolutional Mismatch during Training

I will encounter problems when training to validation, which is 1000 steps Traceback (most recent call last):███████████████████████████████████████████████████████████████████████████| 99/99 [12:35

wangxuanji

'GaussianDiffusion' object has no attribute 'cond' when training with multi-GPU

File "train.py", line 320, in 3.24s/it] main(args, configs) File "train.py", line 196, in main figs, wav_reconstruction, wav_prediction, tag = synth_one_sample( File "/data/workspace/liukaiyang/TTS/DiffGAN-TTS-main/utils/tools.py", line 227, in synth_one_sample mels = [mel_pred[0, :mel_len].float().detach().transpose(0,...

WillQuCD

DiffGAN-TTS
DiffGAN-TTS copied to clipboard

Metadata

Is adversarial training actually necessary?

How to adopt these codes to another non English language

Can I ask you some questions about mel-spectrogram?

Can we just use FastSpeech for inference as baseline result

About DiffSVC

On Input Output Convolutional Mismatch during Training

'GaussianDiffusion' object has no attribute 'cond' when training with multi-GPU

Issues with Audio Quality for Longer Text Inputs Using VCTK Pretrained Model

Seeking Help with Objective Performance Evaluation Code

Python and dependency versions

← Metadata

Owner

Metadata

DiffGAN-TTS DiffGAN-TTS copied to clipboard

Metadata

← Metadata

Owner

Metadata

DiffGAN-TTS
DiffGAN-TTS copied to clipboard