elasticsearch-minhash
elasticsearch-minhash copied to clipboard
Elasticsearch plugin for b-bit minhash algorism
It seems that since 5.0, Elasticsearch has included a built-in [MinHash Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.0/analysis-minhash-tokenfilter.html) What are the differences to this plugin (if any) ? What reason would we have to use...
Hello, i try to execute first command from guide: ``` { "settings":{ "analysis":{ "analyzer":{ "minhash_analyzer":{ "type":"custom", "tokenizer":"standard", "filter":["minhash"] } } } } } ``` I replace `index` key to `settings`...
I was using minhash plugin with kibana, but I couldnt retrieve the bits keyword field in GET... PUT /test_minhash_test { "index":{ "analysis":{ "analyzer":{ "minhash_analyzer":{ "type":"custom", "tokenizer":"standard", "filter":["minhash"] } } }...
Hello, I tried to perform a search query on the stored minhash_field... e.g with the fuzzy search or with the more_like_this query .... but I get an error it cannot...
Let's assume I got a set of news articles in my ES store. Is there a way to use MinHash score value to check a new article if it fits...
Hello, after installing minhash filter in elasticsearch, I started kibana app and in the Dev tool, I copy&pasted the code to create the minhash filter mapping: curl -XPUT 'localhost:9200/my_index' -d...
Is there any chance I can perform search query against minhash field to find similar documents? I create minhash analyzer, add mapping to store minhash with the following code: ```...
Hello, When I try to install elasticsearch-minhash I get the Error: "This plugin was built with an older plugin structure. Contact the plugin author to remove the intermediate "elasticsearch" directory...
this the error I'm getting when I'm trying to use this plugin: ``` { "error": { "root_cause": [ { "type": "generation_exception", "reason": "failed to serialize source for type [message]" }...
what does the number of bits mean in this algorithm? can you please provide a link to the article discussed this specific implementation of minhash algorithm?