raccoonML comments

Results 26 comments of


                                            raccoonML

Not utilizing full GPU at training

Something else is bottlenecking performance. Since it's a cloud server the first thing I would suspect is the filesystem. Make sure the dataset is hosted on some kind of fast,...

Synthesizer alignment

It's possible this will be resolved with more training steps. You can also try restarting the training with a higher reduction factor to make it easier to learn attention. Once...

Poor attention with a different speaker encoder

Which speech dataset are you using? You should be using LibriSpeech or LibriTTS if you want to compare results to the pretrained models of this repo.

proposing accessibility changes for the gui

The major benefit of the toolbox is the audio visualizations, in the form of speaker embeds and spectrograms. If you don't need images, a very basic interface could suffice. Maybe...

proposing accessibility changes for the gui

I don't intend to work on this issue, but I suggest that you come up with detailed requirements to help a developer who is interested in solving this problem. What...

proposing accessibility changes for the gui

Does the developer need to do anything special with wxPython to provide that accessibility info to the screen reader? Another way of stating the question is, if a wxPython interface...

proposing accessibility changes for the gui

Is there a way to do this with PyQT so we don't need to rewrite the interface?

How we can slow down the output audio?

A more advanced solution is to save the attention layer alignments from inference, stretch them by the desired slowdown amount, then run the decoder loop again replacing the attention network...

Improve fidelity?

It can be improved by training a new vocoder model from scratch on higher quality data. You can preprocess the dataset at a higher sampling rate, and the vocoder will...

PermissionError: [Errno 13] Permission denied: 'synthesizer\\saved_models\\logs-pretrained\\taco_pretrained'

Are you using the latest code? That error message pertains to checkpoints developed for an older version of this repo, which used tensorflow.