Do the synthesize inputs must be the .npy file

Open wpy0521 opened this issue 7 years ago • 2 comments

as i known, Clarinet is a end-to-end model(Text-to-Speech). But this model allows only the .npy file as the inputs. Can anyone use a sentence to synthesize ? what's more , i wonder the function of the Clarinet. can it realize the multi-speakers synthetise? or just make synthesize results better?

Jan 04 '19 02:01 wpy0521

same question. The LJspeechDataset and DataLoader only load the .npy data as inputs, not use the sentence text.

Jan 11 '19 10:01 BABALA258

same question here as well. Is there a way that I could use text as input?

Feb 12 '19 22:02 mingboma