Arthur Wu

Results 73 comments of Arthur Wu

hi @kan-bayashi i have done all things as your guide, but the result is the same as the original model. 1. use csmsc PWG conf to train the base model...

sorry, I've given the second sample permission , you can open it try again. Did you perform adaptation of Text2Mel model as well ? --you mean TTS model as this?...

text2mel adaptation is my question, I will try it,thanks.

i got error by jvs guide: `tts_train.py: error: unrecognized arguments: --init_param /data/espnet/egs2/chinese_man/tts1/exp/tts_train_raw_phn_pypinyin_g2p_phone/200epoch.pth ` but I use param "--pretrain_path" like this solve it: ` --pretrain_path /data/espnet/egs2/chinese_man/tts1/exp/tts_train_raw_phn_pypinyin_g2p_phone/200epoch.pth --pretrain_key null` may you want...

I have done everything, and I get the correct result: `https://drive.google.com/file/d/1GhwP68UJ1DzyMbCfExHTDvYhRM7tFQD5/view?usp=sharing` Thank you a lot how to create the voice of style transfer?how to choose model?any suggestion with espnet?

The early papers: https://arxiv.org/pdf/1710.11385.pdf and code: https://github.com/inzva/Audio-Style-Transfer

ok,thanks a lot

[2023-06-26 17:04:13,047] [INFO] [logging.py:96:log_dist] [Rank 0] Using DeepSpeed Optimizer param name adamw as basic optimizer [2023-06-26 17:04:13,057] [INFO] [logging.py:96:log_dist] [Rank 0] DeepSpeed Basic Optimizer = DeepSpeedCPUAdam [2023-06-26 17:04:13,057] [INFO] [utils.py:54:is_zero_supported_optimizer]...

max_len 即使降到512 也out,怀疑是代码本身的问题?或者dp版本问题?