WordSimilarity icon indicating copy to clipboard operation
WordSimilarity copied to clipboard

词库里面没有的词语,比较结果是0

Open soulspirit1229 opened this issue 6 years ago • 4 comments

soulspirit1229 avatar Jan 23 '18 15:01 soulspirit1229

这是默认设定(或许本来也没有哪两个词对比会是0,最小也是0.000000*****)。 当然追求完美的你也可以改为负数,表示这其实是一个异常。 我的项目Final_word_Similarity 您也可以了解一下。

yaleimeng avatar Apr 18 '18 11:04 yaleimeng

我将整个文件解压到python目录下 :\Users\15311\AppData\Local\Programs\Python\Python36\python.exe C:/Users/15311/Desktop/adsfs/test/test_word_similarity.py Traceback (most recent call last): File "C:/Users/15311/Desktop/adsfs/test/test_word_similarity.py", line 38, in a.test_similarity_2010() File "C:/Users/15311/Desktop/adsfs/test/test_word_similarity.py", line 12, in test_similarity_2010 ws_tool = WordSimilarity2010() File "C:\Users\15311\AppData\Roaming\Python\Python36\site-packages\word_similarity_init_.py", line 101, in init super(WordSimilarity2010, self).init() File "C:\Users\15311\AppData\Roaming\Python\Python36\site-packages\word_similarity_init_.py", line 20, in init self.load_cilin(t_cilin_path) File "C:\Users\15311\AppData\Roaming\Python\Python36\site-packages\word_similarity_init.py", line 45, in _load_cilin line = file_obj.readline() UnicodeDecodeError: 'gbk' codec can't decode byte 0xba in position 11: illegal multibyte sequence

却报错,大神这是为什么呢 能出一份详细一点的使用教程吗

zhaochongzc avatar May 10 '18 14:05 zhaochongzc

这个项目算法已是8年前的了。我的项目Final_word_Similarity 按最近2年的新算法迭代了几次,计算效果遥遥领先,不了解一下吗? @zhaochongzc @soulspirit1229

yaleimeng avatar May 11 '18 01:05 yaleimeng

那肯定要拜读一下啦,谢谢你啦

zhaochongzc avatar May 11 '18 06:05 zhaochongzc