server icon indicating copy to clipboard operation
server copied to clipboard

MDEV-10267 Add "ngram" support to MariaDB

Open i-rinat opened this issue 5 years ago • 11 comments
trafficstars

Ngram is a fulltext parser plugin that splits words into overlapping segments of fixed lengths. For example, if 3-grams are in use, string "abcdef" is split into "abc", "bcd", "cde", "def", thus allowing an efficient substring search for CJK texts where splitting by words in not feasible. Only word characters are going into n-grams. Spaces and punctuation characters are treated as separators.

i-rinat avatar Jul 15 '20 00:07 i-rinat