tensorflow_wavenet_vocoder
tensorflow_wavenet_vocoder copied to clipboard
there are some question about my implementation
Hi azraelkuan, thanks for your work and sharing! I encounter three questions during implementation.
my environment: win10, python==3.6.7, tensorflow==1.11, anaconda==3
one is when I finished "preprocess.py", my file(LJSpeech-1-mel.npy) was only 176KB and file(LJSpeech-1-audio.npy) was only 281KB. I think it may be that something is repeatedly covered or the problem is caused by the difference between windows and Linux? i am not sure about this.
The second problem is encountered during the Synthesize step. I didn't find a file called "eavl.txt". There are only three files (LJSpeech-1-audio.npy, LJSpeech-1-mel.npy, train.txt) in output path after preprocess.
the final question is it seems that the input of the parameter is adjusted, unlike the command in the readme? about '--eval_txt' i just set the output folder for preprocess.
tensorflow_wavenet_vocoder>python mul_generate.py --eval_txt ./FeaPath/ --wav_out_path ./WavOut/ checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050 usage: mul_generate.py [-h] [--logdir LOGDIR] [--temperature TEMPERATURE] [--save_every SAVE_EVERY] [--eval_txt EVAL_TXT] [--hparams HPARAMS] checkpoint mul_generate.py: error: unrecognized arguments: --wav_out_path checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050
or this code can't running on windows? Tell me if I'm wrong, thanks ^_^
the eval_txt is extracted from the train_txt。。。。。。i see