THULAC-Python icon indicating copy to clipboard operation
THULAC-Python copied to clipboard

请问为什么txt的格式是utf-8还会出现这个问题

Open PhilrainV opened this issue 5 years ago • 2 comments

UnicodeDecodeError: 'gbk' codec can't decode byte 0xa8 in position 0: incomplete multibyte sequence

PhilrainV avatar Feb 18 '20 13:02 PhilrainV

你是处理file时出现的吗,整体code是什么

kathy98443 avatar Apr 07 '20 04:04 kathy98443

我这里也出现这个问题 代码如下: import thulac
import codecs

thu1 = thulac.thulac() thu1.cut_f("input.txt", "output.txt") print('end')

fanrongqitiancai avatar Feb 05 '22 13:02 fanrongqitiancai