manticoresearch icon indicating copy to clipboard operation
manticoresearch copied to clipboard

custom Chinese dictionary

Open lgl5240 opened this issue 1 year ago • 3 comments

Hello, I'd like to ask if you support the use of a custom Chinese dictionary for segmentation?

lgl5240 avatar Oct 03 '23 16:10 lgl5240

It should be possible. We discussed it in this issue https://github.com/manticoresoftware/manticoresearch/issues/371#issuecomment-654596874

If you manage to make it work, pls let me know, we'd like to add it to the docs, make an article about it etc.

sanikolaev avatar Oct 06 '23 10:10 sanikolaev

这应该是可能的。我们在本期#371(评论)中对此进行了讨论

如果您设法使其工作,请告诉我,我们希望将其添加到文档中,撰写一篇有关它的文章等。

It should be possible. We discussed it in this issue #371 (comment)

If you manage to make it work, pls let me know, we'd like to add it to the docs, make an article about it etc.

thank you

lgl5240 avatar Oct 06 '23 14:10 lgl5240

@sanikolaev

https://www.amazonaws.cn/en/new/2022/amazon-opensearch-custom-dictionaries-ik-analysis-plugin/ https://github.com/soosinha/opensearch-analysis-ik

Here's a more elegant implementation, referring to the IK plugin for opensearch(same to es), that supports a dynamic Api interface to the dictionary, 'remote_ext_dict'

forcemeter avatar Oct 13 '23 11:10 forcemeter