elasticsearch-entity-resolution icon indicating copy to clipboard operation
elasticsearch-entity-resolution copied to clipboard

Duke Record Linkage Mode

Open ofergold opened this issue 9 years ago • 3 comments

Hi Yann,

According to the Duke documentation Duke provides a Record Linkage mode (I.e., linking records from two distinct datasets), as well as a Deduping mode (I.e., identifying duplicates within one dataset.)

In Duke, in order to utilize the Record Linkage mode, one would configure a mapping file to align fields from two datasets.

Does the plugin expose this functionality? I couldn’t find the a reference to such feature. Hopefully I just missed it :)

Thanks in advance!

ofergold avatar Jan 06 '16 00:01 ofergold

Hi Ofer,

No you did not miss it. I did not implement that. I might have a try one of these days.

Yann

YannBrrd avatar Jan 06 '16 09:01 YannBrrd

I would also love to see this feature. Possible to sponsor it?

kyrannian avatar Feb 12 '16 11:02 kyrannian

Hi,

What do you mean by "sponsor" ?

I'm not sure we should implement this within the plugin. I think at first glance that this is more linked to Duke usage in batch mode with Elasticsearch with a source...

YannBrrd avatar Feb 17 '16 08:02 YannBrrd