glow-tts icon indicating copy to clipboard operation
glow-tts copied to clipboard

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Results 48 glow-tts issues
Sort by recently updated
recently updated
newest added

빠른 TTS를 위한 좋은 논문을 내주셔서 감사합니다. 제목에 해당하는 부분을 먼저 말씀드리자면, (944525a commit) train.py의 127번째 line 에서 logger.info에서 진행도를 계산하는 부분에서 gpu 개수가 고려가 되어 있지 않습니다. 해당 부분:...

Hi Jaehyeon, I modified your code to train multiple speakers and it seems to be training and inferring pretty well. Thanks for leaving the code in a state that makes...

Dear contributors, Thank you for sharing your great works. I have successfully reproduced your result with the LJSpeech Dataset. In addition, I have trained your model with Korean Single Speaker...

Hi @jaywalnut310. I took your `inference_hifigan.ipynb` notebook and made it fully runnable inside Google Colab (here's my [Colab Gist](https://colab.research.google.com/gist/sayakpaul/bab1ce7286b70583a7cab0d6a9a4ceaf/inference_hifigan.ipynb)). I think it would make it easier for people to play...

Hi, thanks for this great repo! I am trying to run this repo with nvidia rtx 3090 and cuda 11.2. I have this error whole day. I was trying different...

Hi, I trained the model with my own dataset. However, the duration loss increases gradually after a period of time for val data, it's overfitting, right? Can I stop the...

@jaywalnut310 : can you plz comment on adapting this model for zero shot or few shot tts voice conversion

I can see that we can control epsilon (noise_scale) and duration (length_scale), but I can't find in the code where we can control the temperature.

got this error: (Hifigan384test) D:\Coding\PYFastCache\PYVenv\Hifigan384test\glow-tts-master\monotonic_align>python setup.py build_ext --inplace running build_ext building 'monotonic_align.core' extension error: Unable to find vcvarsall.bat Yes,i know it is probably because cpython or msvc. and Yes,I had...

I used Glow-TTS and Multi-band MelGAN to create Korean TTS using my voice as a dataset. And the sample audio of the result can be found at the bottom of...