MMMMichaelzhang

Results 6 comments of MMMMichaelzhang

mel_tensor = (torch.log(1e-5 + mel_tensor) - self.mean) / self.std have you fixed the noise problem? when i change mean =0, std =1 the noise gone,but it is too loud. @Charlottecuc

how to inference large wav audio file?when I inference a 60s 44100hz audio file ,it cause too much gpu memory,then it stopped.I set cut_length=44100*1,do you have any idea to solve...

my gpu is 24G,I can infer maybe a at most 10s track of 44100.I make every 10s chunks,then conbine them ,but got some audio problems in the connection.if I want...

I have some questions about how to inference in mandarin . First ,I am not sure if it is right for mandarin : _pad = "$" _punctuation = ';:,.!?¡¿—…"«»“” '...

> > > > I use pinyin for asr and styletts, can generate a normal and good results. could you share some details like: how to set inference file _pad...

> > > > For mandarin, i didn't use ipa_phonemes, use pinyin's initials and finals phonemes. > > 1. You can use pypinyin to generate pinyin. > 2. The _initials...