jieba icon indicating copy to clipboard operation
jieba copied to clipboard

bug: 自定义词典添加文本表情不生效

Open idiomer opened this issue 3 years ago • 1 comments

如下所示:有括号的自定义表情能添加但分词不work

import jieba
biaoqing_list = ['[捂脸]', '[doge]',  '___捂脸___',  '___doge___']
for x in biaoqing_list:
    jieba.add_word(x, freq=10000, tag='nz')
print(jieba.user_word_tag_tab)
print(jieba.lcut('[捂脸][doge]哈哈哈___捂脸___和___doge___'))

# {'[捂脸]': 'nz', '[doge]': 'nz', '___捂脸___': 'nz', '___doge___': 'nz'}
# ['[', '捂脸', ']', '[', 'doge', ']', '哈哈哈', '___捂脸___', '和', '___doge___']

idiomer avatar Jun 28 '21 03:06 idiomer