CosyVoice 如何避免instruct

当instruct_text比较长的时候，有时候就会被读出来，多试几次发现也不是每次都读出来，非常难掌握现在我的解决办法是尽量让instruct_text短一些，但也不知道具体要多短才能完全避免被读出来 https://github.com/FunAudioLLM/CosyVoice/issues/1120

May 06 '25 07:05 sunshineo

请注意我说的不是那几个固定的 fine grained control

May 06 '25 07:05 sunshineo

你好，请问固定的那几个情感指令有哪些，在哪里看知道吗

May 08 '25 02:05 sunzhongbob

固定的fine grained control在cosyvoice/tokenizer/tokenizer.py

special_tokens = {
            'eos_token': '<|endoftext|>',
            'pad_token': '<|endoftext|>',
            'additional_special_tokens': [
                '<|im_start|>', '<|im_end|>', '<|endofprompt|>',
                '[breath]', '<strong>', '</strong>', '[noise]',
                '[laughter]', '[cough]', '[clucking]', '[accent]',
                '[quick_breath]',
                "<laughter>", "</laughter>",
                "[hissing]", "[sigh]", "[vocalized-noise]",
                "[lipsmack]", "[mn]"
            ]
        }

May 08 '25 04:05 sunshineo

固定的细粒度控制cosyvoice/tokenizer/tokenizer.py

special_tokens = {
            'eos_token': '<|endoftext|>',
            'pad_token': '<|endoftext|>',
            'additional_special_tokens': [
                '<|im_start|>', '<|im_end|>', '<|endofprompt|>',
                '[breath]', '<strong>', '</strong>', '[noise]',
                '[laughter]', '[cough]', '[clucking]', '[accent]',
                '[quick_breath]',
                "<laughter>", "</laughter>",
                "[hissing]", "[sigh]", "[vocalized-noise]",
                "[lipsmack]", "[mn]"
            ]
        }

谢谢

May 08 '25 05:05 sunzhongbob

This issue is stale because it has been open for 30 days with no activity.

Jun 08 '25 02:06 github-actions[bot]

请问哪里能获取训练时用到了哪些prefix单词？训练数据有哪些“方言”，“情感”，“速度”，“角色”相关的单词？我想生成时只用这些单词效果会好很多吧？

Jul 02 '25 16:07 sunshineo

mark！指令合成预训练模型，生成的内容具有有指令文本。

Nov 03 '25 08:11 JohnHerry

如何避免instruct_text被读出来？