geowave icon indicating copy to clipboard operation
geowave copied to clipboard

Benchmark using CountMinSketch on character n-grams for text indexing

Open rfecher opened this issue 3 years ago • 0 comments

It should be an improvement to use CountMinSketch as an index stat for text indicies and then for terms that are longer than the "n" for the n-gram we can choose the n-gram within the term that has the smallest estimated cardinality.

rfecher avatar Dec 17 '20 20:12 rfecher