Liujingxiu23

Results 46 comments of Liujingxiu23

Yes, problem still exist, and when run “pip install python-rtmidi “, it can not be installed successfully

How about training the encoder/speaker_verification using English multi-speaker data-sets, but training the synthesizer using Chinese database, suppose both the data are enough for each individual model separately.

@CorentinJ Thank you for your reply,may be I should find some Chinese data-sets for ASR to train the speaker verification model.

@magneter I have not trained the Chinese model, I don't have enough data to train the speaker verification model, I am trying to collect suitable data now

use gcc 4.9 may solve the problem

@malradhi How about the wav converted? My generated wavs just like noise

Have you tried Fastspeech combined with melgan? How is the result?

@Wenqikry Have you tried https://github.com/Rayhane-mamah/Tacotron-2 or https://github.com/NVIDIA/tacotron2 to train log-mels? Combine with the Melgan? Do you have any experiences? I used https://github.com/Rayhane-mamah/Tacotron-2 , change feat to log-mel as this repo,...

@yanggeng1995 Ask for help! For multiband-lpcnet, the feature of each band should be extracted seperatedly, right? If I used the original features in lpcnet, I should first get subband-wavs(4 bands),...

@yanggeng1995 I tried the way as you said, all the four subbands share the same lpc of the fullband. But in the stage of dump data, I mean "write_audio" in...