THULAC-Python icon indicating copy to clipboard operation
THULAC-Python copied to clipboard

出现 “//” 符号时分词会报错

Open huntercmd opened this issue 8 years ago • 3 comments

你好,当出现 “//” 这个网址中常见的符号时分词会报错,但是网页demo版可以分词

报错如下: content = "我们很//开心"

text = thu1.cut(content, text=True) #进行一句话分词 File "C:\Anaconda2\lib\site-packages\thulac_init_.py", line 78, in cut txt += reduce(lambda x, y: x + ' ' + y, self.cutline(line)) + '\n' File "C:\Anaconda2\lib\site-packages\thulac_init_.py", line 133, in cutline self.punctuation.adjustTag(tagged) File "C:\Anaconda2\lib\site-packages\thulac\manage\Punctuation.py", line 39, in adjustTag tmp = sentence[i][0] IndexError: list index out of range [Finished in 4.9s with exit code 1]

huntercmd avatar Feb 13 '17 07:02 huntercmd

感谢您对THULAC的支持,您反馈的问题已经解决,更新后忘记及时回复了,抱歉

MaJunhua avatar Feb 28 '17 14:02 MaJunhua

请问pip版本何时能更新?

huntercmd avatar Mar 01 '17 01:03 huntercmd

你好,pip版已经更新,可以重新更新使用~

祝好!

2017-03-01 9:14 GMT+08:00 HunterCmd [email protected]:

请问pip版本何时能更新?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/thunlp/THULAC-Python/issues/9#issuecomment-283213868, or mute the thread https://github.com/notifications/unsubscribe-auth/AL1GvRB8WeSMSEHDP7aHBoSHkibx5Wssks5rhMZ1gaJpZM4L-3_u .

-- 郭志芃 清华大学计算机科学与技术系 电话:18813046062 通信地址:北京市,海淀区,清华大学,紫荆公寓2号楼307B,邮编100084 邮箱:[email protected]

Guo Zhipeng Department of Computer Science and Technology, Tsinghua University Tel: 18813046062 Address: Room 307B, No. 2 Zijing Building, Tsinghua University, Beijing, 100084 Email: [email protected]

gzp9595 avatar Mar 01 '17 02:03 gzp9595