jieba.NET
jieba.NET copied to clipboard
自定义词典下的posSeg.Cut("北京市海淀区",true);结果不正确
JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("北京"); JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("北京市"); JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("海淀区");
JiebaNet.Segmenter.WordDictionary.Instance.AddWord("北京", 6000, "city");
JiebaNet.Segmenter.WordDictionary.Instance.AddWord("北京市", 6000, "city");
JiebaNet.Segmenter.WordDictionary.Instance.AddWord("海淀区", 6000, "town");
//seg.LoadUserDict(AppDomain.CurrentDomain.SetupInformation.ApplicationBase + "userdict.txt");
JiebaNet.Segmenter.WordDictionary.Instance.ContainsWord("北京");
JiebaNet.Segmenter.WordDictionary.Instance.ContainsWord("北京市");
var tokens = posSeg.Cut("北京市海淀区",true);
按道理说cut之后得到的结果应该是 北京市 city 海淀区 town 但实际得到的结果是 北京市 ns 海淀区 ns
嗯,这个好像是有问题的,我后面看看源码。谢谢你的issue。
On 13 August 2018 at 14:51, gzcqy [email protected] wrote:
JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("北京"); JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("北京市"); JiebaNet.Segmenter.WordDictionary.Instance.DeleteWord("海淀区");
JiebaNet.Segmenter.WordDictionary.Instance.AddWord("北京", 6000, "city"); JiebaNet.Segmenter.WordDictionary.Instance.AddWord("北京市", 6000, "city"); JiebaNet.Segmenter.WordDictionary.Instance.AddWord("海淀区", 6000, "town"); //seg.LoadUserDict(AppDomain.CurrentDomain.SetupInformation.ApplicationBase + "userdict.txt"); JiebaNet.Segmenter.WordDictionary.Instance.ContainsWord("北京"); JiebaNet.Segmenter.WordDictionary.Instance.ContainsWord("北京市"); var tokens = posSeg.Cut("北京市海淀区",true);按道理说cut之后得到的结果应该是 北京市 city 海淀区 town 但实际得到的结果是 北京市 ns 海淀区 ns
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/anderscui/jieba.NET/issues/52, or mute the thread https://github.com/notifications/unsubscribe-auth/ADppxwCi2SeOnZCctIrF6AbsYq2Om2Ppks5uQSIHgaJpZM4V5-Ni .
-- Thanks Anders Cui