tacotron issues

Sorry , I cannot reprocess the dataset,using thchs I cannot solve this problem.

Traceback (most recent call last): File "C:\Users\72970\Anaconda3\envs\tensorflow\lib\concurrent\futures\process.py", line 175, in _process_worker r = call_item.fn(*call_item.args, **call_item.kwargs) File "C:\Users\72970\Desktop\1\datasets\thchs30.py", line 74, in _process_utterance mel_spectrogram = audio.melspectrogram(wav).astype(np.float32) File "C:\Users\72970\Desktop\1\util\audio.py", line 66, in melspectrogram...

Creiphyn

Audio generated using eval is not same as generated by demo_server for the same checkpoint

1

I have tried training on Emotion Dataset which have multiple emotions and same text. So while training at every checkpoint it generates an audio file using some text (dont know...

prateekgupta891

Non-English Data (Bengali)

I'm working for the Bengali language and using `cleaners=transliteration_cleaners`. But most of the cases it gives wrong phoneme representation for Bengali. Can I use **IPA** [1]? [1] https://en.wikipedia.org/wiki/International_Phonetic_Alphabet

Rajan-sust

ssml support?

Hello everyone! Do you guys have any plans for SSML support for the Tacotron? It would be cool to be able to set specific pause duration, stress-positions and so on!

vcjob

Explanation of the decoder and audio sample rate

1

Hello! Thank you for the sharing your implementation of the tacotron. Your code is well documented, but I still can't completely figure out some things. 1. Could you (or somebody...

morelen17

Synthesis time is high on a Jetson TX2 GPU

2

@keithito I was able to successfully run the model using the latest implementation from this project on a NVIDIA Jetson TX2 HW with GPU support using CUDA. I used the...

sranjeet81

Issues training with Cantonese language

I've updated the `symbols.py` for training Cantonese. I'm using Jyutping representation of transcripts which English characters with numbers(1-6). Jyutping representation ```text wong5 si6 faat3 sang1 zoi6 cat1 sap6 ng5 nin4...

mirfan899

Failed on demo_server.py

2

Please help, My GPU is RTX 2070 8GB and I'm running using TF 1.12 with CUDA 9.0 and CuDNN 7.5. When I run the demo_server.py, I got the following error...

ronykalfarisi

audio quality and long sentences issues

3

hello, my own model in native language was trained based on default hparams.py parameters and LJSpeech dataset standards, but need some improvements: 1. the generated sample has low audio quality...

ramilrg

When I iterate 13,000 times, why is the synthesized speech a piece of silence

4

Text2-m

tacotron
tacotron copied to clipboard

Metadata

Sorry , I cannot reprocess the dataset,using thchs I cannot solve this problem.

Audio generated using eval is not same as generated by demo_server for the same checkpoint

Non-English Data (Bengali)

ssml support?

Explanation of the decoder and audio sample rate

Synthesis time is high on a Jetson TX2 GPU

Issues training with Cantonese language

Failed on demo_server.py

audio quality and long sentences issues

When I iterate 13,000 times, why is the synthesized speech a piece of silence

← Metadata

Owner

Metadata

tacotron tacotron copied to clipboard

Metadata

← Metadata

Owner

Metadata

tacotron
tacotron copied to clipboard