deduplicate-elasticsearch icon indicating copy to clipboard operation
deduplicate-elasticsearch copied to clipboard

Remove duplicate documents from Elasticsearch

Results 4 deduplicate-elasticsearch issues
Sort by recently updated
recently updated
newest added

Line in the script: matching_docs = es.mget(index="stocks", doc_type="doc", body={"ids": array_of_ids}) should change to: matching_docs = es.mget(index="stocks", doc_type="_doc", body={"ids": array_of_ids}) doc_type shoud be _doc instead of doc, otherwise the matching_docs return...

Enable and implement secrets file, allowing credentials to securely be stored and access preventing plaintext passwords from being present in working script. Enable index as variable so we only have...

Just thought it would be a good addition to make the index into a var. Saves n00bs like me of missing the second place where to change the index. :)