Comprehensive-Transformer-TTS issues

An errors with running the preprocess.py

2

I'm trying to preprocess the VCTK dataset, and stuck on the 'Computing statistic quantities' step. When I copy from repo preprocessed_data files instead, the training run successful. Firstly, there is...

dasstyx

Multi-GPU training could not work normally?

1

RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one. This error indicates th at your module has parameters that were not used in producing...

GuangChen2016

Are sent and word duration loss necessary for unsupervised alignment ?

3

Are sent and word duration loss necessary for unsupervised alignment for a robust duration prediction?

xiaoyangnihao

Problem with Utterance-level Prosody extractor of DelightfulTTS

4

I've recently been experimenting with your implementation of DelightfulTTS and the voice quality is awesome. However I found out that the embedding vector output of Utterance-level Prosody extractor is very...

vietvq-vbee

New TTS Model request

19

Recently two papers regarding Transformer TTS pops up and I think both are suitable for this repo: 1) [DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021](https://arxiv.org/abs/2110.12612) 2) [Emphasis...

rishikksh20

requirements fail to install

seems packages may have updated names, specific python version appears to be required, requires c++ build tools suggest update: python~3.8.0 and =1.7.0 (==2.0.0)

chboishabba

bug in calculate the energy in FastSpeechSTFT

I think here is a bug in audio/stft.py: 252 `energy = np.sqrt(np.exp(mel) ** 2).sum(-1)` This code did nothing but just sum the abs of the `np.exp(mel)`, while we expect it...

ease-zh

Gibberish synthesized speech from my own model

Hi, I am training a model on the ryanspeech dataset. Currently it is on 125k+ steps, and I tried to synthesize a speech with the checkpoint, but the result is...

godspirit00

Prosody Loss

7

Hi, I am adding your MDN prosody modeling code segment to my tacotron but I encountered several problems about the code segment about prosody modeling. First, the prosody loss is...

inconnu11

Mixture density network

안녕하세요 먼저 코드 공유 감사드립니다. 다름이 아니라 mdn쪽에서 loss가 nan이나 infinty가 뜨길래 cliping을 통해서 학습을 돌리고 결과를 보았을때 소리가 아예 생성이 안되는것 같습니다. 혹시 어떤 문제가 예상되는지 여쭤봐도 될까요? 직접...

bondio77

Comprehensive-Transformer-TTS
Comprehensive-Transformer-TTS copied to clipboard

Metadata

An errors with running the preprocess.py

Multi-GPU training could not work normally?

Are sent and word duration loss necessary for unsupervised alignment ?

Problem with Utterance-level Prosody extractor of DelightfulTTS

New TTS Model request

requirements fail to install

bug in calculate the energy in FastSpeechSTFT

Gibberish synthesized speech from my own model

Prosody Loss

Mixture density network

← Metadata

Owner

Metadata

Comprehensive-Transformer-TTS Comprehensive-Transformer-TTS copied to clipboard

Metadata

← Metadata

Owner

Metadata

Comprehensive-Transformer-TTS
Comprehensive-Transformer-TTS copied to clipboard