Kyubyong Park comments

Results 79 comments of


                                            Kyubyong Park

Low GPU usage

@candlewill Did you find why the multi gpu version is slower than the single gpu one? For me, the former is definitely way faster than the latter.

Is tacotron inference real-time?

I don't know, honestly. Does the original paper mention anything about it?

Is tacotron inference real-time?

@msobhan69 Thanks. I believe what the paper said is true, but I don't know if it means Tacotron can generate samples real-time.

Cannot access pretrained model and custom bible dataset (Dropbox 404)

I apologise for this, guys. I don't know why the dropbox links stopped working. But, anyway I've created new links. Check them out.

OOM out of memory when training step.

Preprocessing might be a solution. Save inputs to disk as numpy arrays. Or adjust the hyperprams.

wrong size of conv1d in CBHG, Post-processing net

You guys are right. I've changed. Thanks.

wrong size of conv1d in CBHG, Post-processing net

You're right candlewill. But I don't see any particular reason why we should make things complicated, so I'll just change the output units of the second conv1d layer to 128...

@ggsonic Nice work! If you share training time or curve as well as your modified code, it would be appreciated. Plus, instance normalization instead of batch normalization... interesting. Is anyone...

good results

@ggsonic Thanks! I guess you're right. I've changed the `reduce_frames` and adjust other relevant parts.

eval.py: Evaluation broken

Technically speaking mel-scale is not exactly the same as log. See https://en.wikipedia.org/wiki/Mel_scale. The paper says they use melspectrogram and linear-scale log magnitude (spectrogram). So the `spectrogram2wav` converts the predicted magnitude...