elasticsearch-entity-resolution
elasticsearch-entity-resolution copied to clipboard
Duke Record Linkage Mode
Hi Yann,
According to the Duke documentation Duke provides a Record Linkage mode (I.e., linking records from two distinct datasets), as well as a Deduping mode (I.e., identifying duplicates within one dataset.)
In Duke, in order to utilize the Record Linkage mode, one would configure a mapping file to align fields from two datasets.
Does the plugin expose this functionality? I couldn’t find the a reference to such feature. Hopefully I just missed it :)
Thanks in advance!
Hi Ofer,
No you did not miss it. I did not implement that. I might have a try one of these days.
Yann
I would also love to see this feature. Possible to sponsor it?
Hi,
What do you mean by "sponsor" ?
I'm not sure we should implement this within the plugin. I think at first glance that this is more linked to Duke usage in batch mode with Elasticsearch with a source...