DiffGAN-TTS icon indicating copy to clipboard operation
DiffGAN-TTS copied to clipboard

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Results 19 DiffGAN-TTS issues
Sort by recently updated
recently updated
newest added

I realise that when I remove adversarial loss and feature match loss, it still works well and has no degeneration of performance. This makes me question the role of adversarial...

HI@keonlee9420, I have some questions to ask you about the mel-spectrogram. In the picture, ![image](https://user-images.githubusercontent.com/94910118/176336644-a71a4bae-117b-4557-9dfb-ec8b32ebe3f1.png) The above mel-spectrogram alignment has been generated, but the horizontal details have not been released...

Hi Keon, thanks so much for sharing this wonderful project. I am wondering can we just use the FastSpeech part for inference? Looking forward to your reply

Hello, sorry for bothering you. Have you contacted with DiffSVC? I saw a code for DiffSVC is similar with yours but it is uncompleted.

I will encounter problems when training to validation, which is 1000 steps Traceback (most recent call last):███████████████████████████████████████████████████████████████████████████| 99/99 [12:35

File "train.py", line 320, in 3.24s/it] main(args, configs) File "train.py", line 196, in main figs, wav_reconstruction, wav_prediction, tag = synth_one_sample( File "/data/workspace/liukaiyang/TTS/DiffGAN-TTS-main/utils/tools.py", line 227, in synth_one_sample mels = [mel_pred[0, :mel_len].float().detach().transpose(0,...

Hello @keonlee9420, I've been working with the VCTK pretrained model provided in the GitHub repository and encountered some issues regarding the audio quality for longer text inputs. While the initial...

Hello @keonlee9420 I wanted to express how much I admire your beautiful open-source code. I was wondering if you could kindly provide some code for objective performance evaluation. Your assistance...

Hello! Could you update the docs with info about using python==3.8 making things easier (praat-parselmouth only distributes binaries for python 3.8 and I couldn't get it to compile), numba==0.49 (numba.decorators...