hyzhan comments

Results 8 comments of


                                            hyzhan

Train as a Tacotron1 script problem

I have try “use_gst=False”， but it seems to be the same as tacotron1？ Although the refnet_outputs will change, but the generated audio will hardly change with different reference audio.

about final loss?

What about level of s_error can get an understandable audio ?

post-processing tech on the generated waves

@candlewill What about dataset size of "e2e_lpcnet_samples_share.zip". It sounds well. My e2e_demo have some noise, and loss about 3.35 for default parameters.

Speaker adaptation - Fine tuning

@m-toman @Rayhane-mamah Fine tune by swapping out the data seem to got a voice that some difference between the fine-tuned data. How to solve this problem if I have not...

A great improvement has been made for master branch (LJSpeech)

@begeekmyfriend @keithito Actually not so complicated...Just like this: from models.attention import LocationSensitiveAttention attention_mechanism = LocationSensitiveAttention(hp.attention_dim, encoder_outputs, hparams=hp, mask_encoder=hp.mask_encoder, memory_sequence_length=input_lengths, smoothing=hp.smoothing, cumulate_weights=hp.cumulative_weights) and replace origin code in AttentionWrapper "BahdanauAttention(hp.attention_depth, encoder_outputs)" with...

hyzhan

Train as a Tacotron1 script problem

about final loss?

post-processing tech on the generated waves

Speaker adaptation - Fine tuning

A great improvement has been made for master branch (LJSpeech)

No training speaker encoder?

can you share some demo?

Wrong kernel sizes in the JDC model