Liujingxiu23 comments

Results 46 comments of


Liujingxiu23

ImportError: No module named rtmidi

Yes, problem still exist, and when run “pip install python-rtmidi “， it can not be installed successfully

Support for other languages

How about training the encoder/speaker_verification using English multi-speaker data-sets, but training the synthesizer using Chinese database, suppose both the data are enough for each individual model separately.

Support for other languages

@CorentinJ Thank you for your reply，may be I should find some Chinese data-sets for ASR to train the speaker verification model.

Support for other languages

@magneter I have not trained the Chinese model, I don't have enough data to train the speaker verification model, I am trying to collect suitable data now

regex_error?

use gcc 4.9 may solve the problem

not able to reproduce the results in original paper

@malradhi How about the wav converted? My generated wavs just like noise

How to combine melGAN with feature predictor like FastSpeech or tacotron2?

Have you tried Fastspeech combined with melgan? How is the result?

How can I synthesize my own text to speech?

@Wenqikry Have you tried https://github.com/Rayhane-mamah/Tacotron-2 or https://github.com/NVIDIA/tacotron2 to train log-mels? Combine with the Melgan? Do you have any experiences? I used https://github.com/Rayhane-mamah/Tacotron-2 , change feat to log-mel as this repo,...

Merge multi-band linear prediction into LPCNet

@yanggeng1995 Ask for help! For multiband-lpcnet, the feature of each band should be extracted seperatedly, right? If I used the original features in lpcnet, I should first get subband-wavs(4 bands),...

Merge multi-band linear prediction into LPCNet

@yanggeng1995 I tried the way as you said, all the four subbands share the same lpc of the fullband. But in the stage of dump data, I mean "write_audio" in...