CosyVoice icon indicating copy to clipboard operation
CosyVoice copied to clipboard

如何避免instruct_text被读出来?

Open sunshineo opened this issue 8 months ago • 7 comments

当instruct_text比较长的时候,有时候就会被读出来,多试几次发现也不是每次都读出来,非常难掌握 现在我的解决办法是尽量让instruct_text短一些,但也不知道具体要多短才能完全避免被读出来 https://github.com/FunAudioLLM/CosyVoice/issues/1120

sunshineo avatar May 06 '25 07:05 sunshineo

请注意我说的不是那几个固定的 fine grained control

sunshineo avatar May 06 '25 07:05 sunshineo

你好,请问固定的那几个情感指令有哪些,在哪里看知道吗

sunzhongbob avatar May 08 '25 02:05 sunzhongbob

固定的fine grained control在cosyvoice/tokenizer/tokenizer.py

special_tokens = {
            'eos_token': '<|endoftext|>',
            'pad_token': '<|endoftext|>',
            'additional_special_tokens': [
                '<|im_start|>', '<|im_end|>', '<|endofprompt|>',
                '[breath]', '<strong>', '</strong>', '[noise]',
                '[laughter]', '[cough]', '[clucking]', '[accent]',
                '[quick_breath]',
                "<laughter>", "</laughter>",
                "[hissing]", "[sigh]", "[vocalized-noise]",
                "[lipsmack]", "[mn]"
            ]
        }

sunshineo avatar May 08 '25 04:05 sunshineo

固定的细粒度控制cosyvoice/tokenizer/tokenizer.py

special_tokens = {
            'eos_token': '<|endoftext|>',
            'pad_token': '<|endoftext|>',
            'additional_special_tokens': [
                '<|im_start|>', '<|im_end|>', '<|endofprompt|>',
                '[breath]', '<strong>', '</strong>', '[noise]',
                '[laughter]', '[cough]', '[clucking]', '[accent]',
                '[quick_breath]',
                "<laughter>", "</laughter>",
                "[hissing]", "[sigh]", "[vocalized-noise]",
                "[lipsmack]", "[mn]"
            ]
        }

谢谢

sunzhongbob avatar May 08 '25 05:05 sunzhongbob

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Jun 08 '25 02:06 github-actions[bot]

请问哪里能获取训练时用到了哪些prefix单词?训练数据有哪些“方言”,“情感”,“速度”,“角色”相关的单词?我想生成时只用这些单词效果会好很多吧?

sunshineo avatar Jul 02 '25 16:07 sunshineo

mark! 指令合成预训练模型,生成的内容具有有指令文本。

JohnHerry avatar Nov 03 '25 08:11 JohnHerry