Kyubyong Park comments

Results 79 comments of


                                            Kyubyong Park

raising the predicted magnitudes by a power of 1.2, not input magnitudes

Okay, let me give you an example. Correct me if I'm wrong. Assume a value of an element of the original magnitude is 2. The best model should output 2,...

raising the predicted magnitudes by a power of 1.2, not input magnitudes

Actually, I've changed the relevant codes. See https://github.com/Kyubyong/tacotron/blob/master/utils.py#L42 and https://github.com/Kyubyong/tacotron/blob/master/eval.py#L65 I think this revision is closer to the paper. Thanks, guys!

better reuslut

Would you share your code?

Other language

Absolutely. I'm doing that for 10 non-English languages. So far so good.

no batch-norm for conv1d in encoder

If you see Table 1 on page 4 of the paper, the second layer of the Conv1D projections is described as `conv-3-128-Linear`. If we don't apply activation, we shouldn't normalize,...

no batch-norm for conv1d in encoder

@candlewill Thanks for your question. Well, what I meant was we shouldnt apply activation or normalization before the final layer because usually we are to yield logits. In this case,...

How many epochs?

The fatc that 3/5 of the generated file is silent looks fine, because we intended to reconstruct them (zero paddings). The training curve looks good, too. When I was training...

How many epochs?

Some human-like voice is heard, though I can't recognize what he(?)'s saying about. (I think it's natural because the data is far from enough) I've recently revised the code. When...

How many epochs?

@xuerq I'm running a sanity-check test. I'll share with you as soon as it's done.

Empty generated waves

Can you share your training loss graph?