elasticsearch-minhash icon indicating copy to clipboard operation
elasticsearch-minhash copied to clipboard

Elasticsearch plugin for b-bit minhash algorism

Results 14 elasticsearch-minhash issues
Sort by recently updated
recently updated
newest added

It seems that since 5.0, Elasticsearch has included a built-in [MinHash Token Filter](https://www.elastic.co/guide/en/elasticsearch/reference/5.0/analysis-minhash-tokenfilter.html) What are the differences to this plugin (if any) ? What reason would we have to use...

Hello, i try to execute first command from guide: ``` { "settings":{ "analysis":{ "analyzer":{ "minhash_analyzer":{ "type":"custom", "tokenizer":"standard", "filter":["minhash"] } } } } } ``` I replace `index` key to `settings`...

question

I was using minhash plugin with kibana, but I couldnt retrieve the bits keyword field in GET... PUT /test_minhash_test { "index":{ "analysis":{ "analyzer":{ "minhash_analyzer":{ "type":"custom", "tokenizer":"standard", "filter":["minhash"] } } }...

Hello, I tried to perform a search query on the stored minhash_field... e.g with the fuzzy search or with the more_like_this query .... but I get an error it cannot...

question

Let's assume I got a set of news articles in my ES store. Is there a way to use MinHash score value to check a new article if it fits...

Hello, after installing minhash filter in elasticsearch, I started kibana app and in the Dev tool, I copy&pasted the code to create the minhash filter mapping: curl -XPUT 'localhost:9200/my_index' -d...

question

Is there any chance I can perform search query against minhash field to find similar documents? I create minhash analyzer, add mapping to store minhash with the following code: ```...

question

Hello, When I try to install elasticsearch-minhash I get the Error: "This plugin was built with an older plugin structure. Contact the plugin author to remove the intermediate "elasticsearch" directory...

question

this the error I'm getting when I'm trying to use this plugin: ``` { "error": { "root_cause": [ { "type": "generation_exception", "reason": "failed to serialize source for type [message]" }...

question

what does the number of bits mean in this algorithm? can you please provide a link to the article discussed this specific implementation of minhash algorithm?

question