elasticsearch-entity-resolution
elasticsearch-entity-resolution copied to clipboard
Elasticsearch entity resolution plugin based on Duke
Just curious if it is likely to happen or not.
Hello! Is there a method to use this plugin on nested field? Mapping is: ``` { "components": { "mappings": { "component": { "properties": { ... OTHER FIELDS ... "values": {...
I'm having an issue with this plugin. It seems as if no matter what I try to do all of my hits only get rescored by the low. My query...
Hi Yann, I need to use WeightedLevenshtein which is available in Duke. In Duke I set the parameters like this: ``` ``` then create the comparator named wl like this:...
Suppose we've generated, through genetic Duke algorithm or by hand, an xml configuration file (maybe also duke's output xml), which contains, for each field of a given entity, thresholds (high,...
Hi Yann, According to the Duke documentation Duke provides a Record Linkage mode (I.e., linking records from two distinct datasets), as well as a Deduping mode (I.e., identifying duplicates within...
[Tests implemented](https://github.com/YannBrrd/elasticsearch-entity-resolution/blob/master/src/test/java/org/yaba/entity/script/JaccardIndexComparatorTests.java) show that JaccardIndex does not provide stable scores. Seems Duke related. @larsga do you have any idea about where it could come from ?
Hi Yann, I run into a problem with compareProb. I think that v1 is null at this point: ``` for (String v1 : vs1) { if (v1.equals("")) { continue; }...