elasticsearch-rtf icon indicating copy to clipboard operation
elasticsearch-rtf copied to clipboard

该发行版里面同义词功能不work

Open Steven-Z-Yang opened this issue 7 years ago • 2 comments

我使用官方指南来实验同义词功能结果,结果在rtf版本里面同义词功能不work,在正常的英文发行版里面没有任何问题。 测试路径:https://www.elastic.co/guide/en/elasticsearch/guide/current/using-synonyms.html

Steven-Z-Yang avatar Oct 19 '16 09:10 Steven-Z-Yang

麻烦测试过程发一下

medcl avatar Jan 05 '17 08:01 medcl

就是按照官方给的路径: 第一步:PUT /my_index { "settings": { "analysis": { "filter": { "my_synonym_filter": { "type": "synonym", "synonyms": [ "british,english", "queen,monarch" ] } }, "analyzer": { "my_synonyms": { "tokenizer": "standard", "filter": [ "lowercase", "my_synonym_filter" ] } } } } } 第二步: GET /my_index/_analyze { "analyzer" : "my_synonyms", "text" : "Elizabeth is the English queen" } 官方的结果是: Pos 1: (elizabeth) Pos 2: (is) Pos 3: (the) Pos 4: (british,english) Pos 5: (queen,monarch)

而rtf的分析结果是: { "tokens": [ { "token": "elizabeth", "start_offset": 0, "end_offset": 9, "type": "<ALPHANUM>", "position": 0 }, { "token": "is", "start_offset": 10, "end_offset": 12, "type": "<ALPHANUM>", "position": 1 }, { "token": "the", "start_offset": 13, "end_offset": 16, "type": "<ALPHANUM>", "position": 2 }, { "token": "english", "start_offset": 17, "end_offset": 24, "type": "<ALPHANUM>", "position": 3 }, { "token": "queen", "start_offset": 25, "end_offset": 30, "type": "<ALPHANUM>", "position": 4 } ] }

明显english这个token没有加上同义词british,另一个也是

Steven-Z-Yang avatar Jan 06 '17 08:01 Steven-Z-Yang