CosyVoice icon indicating copy to clipboard operation
CosyVoice copied to clipboard

自然语言控制使用格式

Open vedbobo opened this issue 9 months ago • 0 comments

官方是格式是: for i, j in enumerate(cosyvoice.inference_instruct('在面对挑战时,他展现了非凡的<strong>勇气</strong>与<strong>智慧</strong>。', '中文男', 'Theo \'Crimson\', is a fiery, passionate rebel leader. Fights with fervor for justice, but struggles with impulsiveness.', stream=False)): torchaudio.save('instruct_{}.wav'.format(i), j['tts_speech'], cosyvoice.sample_rate) 但我有数据格式是: <|zh|><|HAPPY|><|Speech|><|withitn|>在他讲述那个荒诞故事的过程中,他突然停下来 ,因为他自己也被逗笑了。 不能直接丢进去合成,会把这些标签也读出来了。要严格按照demo的来吗?

vedbobo avatar May 29 '25 10:05 vedbobo