Comprehensive-Transformer-TTS icon indicating copy to clipboard operation
Comprehensive-Transformer-TTS copied to clipboard

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...

Results 10 Comprehensive-Transformer-TTS issues
Sort by recently updated
recently updated
newest added

I'm trying to preprocess the VCTK dataset, and stuck on the 'Computing statistic quantities' step. When I copy from repo preprocessed_data files instead, the training run successful. Firstly, there is...

RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one. This error indicates th at your module has parameters that were not used in producing...

Are sent and word duration loss necessary for unsupervised alignment for a robust duration prediction?

I've recently been experimenting with your implementation of DelightfulTTS and the voice quality is awesome. However I found out that the embedding vector output of Utterance-level Prosody extractor is very...

Recently two papers regarding Transformer TTS pops up and I think both are suitable for this repo: 1) [DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021](https://arxiv.org/abs/2110.12612) 2) [Emphasis...

seems packages may have updated names, specific python version appears to be required, requires c++ build tools suggest update: python~3.8.0 and =1.7.0 (==2.0.0)

I think here is a bug in audio/stft.py: 252 `energy = np.sqrt(np.exp(mel) ** 2).sum(-1)` This code did nothing but just sum the abs of the `np.exp(mel)`, while we expect it...

Hi, I am training a model on the ryanspeech dataset. Currently it is on 125k+ steps, and I tried to synthesize a speech with the checkpoint, but the result is...

Hi, I am adding your MDN prosody modeling code segment to my tacotron but I encountered several problems about the code segment about prosody modeling. First, the prosody loss is...

안녕하세요 먼저 코드 공유 감사드립니다. 다름이 아니라 mdn쪽에서 loss가 nan이나 infinty가 뜨길래 cliping을 통해서 학습을 돌리고 결과를 보았을때 소리가 아예 생성이 안되는것 같습니다. 혹시 어떤 문제가 예상되는지 여쭤봐도 될까요? 직접...