recordlinkage icon indicating copy to clipboard operation
recordlinkage copied to clipboard

Add textdistance matching algorithms in recordlinkage compare string

Open rafmacalaba opened this issue 4 years ago • 4 comments

Hello,

I am currently using this module to do some record linking stuff, I am thinking of contributing some string matching algorithms that are implemented in textdistance, I'm currently using those already in my code by creating a modified String class and thought that this might be a good addition in the library.

Please inform me if you find this helpful and I am more than willing to implement it in this package.

Regards, Rafael

rafmacalaba avatar Jul 22 '20 10:07 rafmacalaba

Yes, please. I think they would do well as comparator classes. I forget what the default string comparison is.

On Wed, Jul 22, 2020 at 4:41 AM Rafael Macalaba [email protected] wrote:

Hello,

I am currently using this module to do some record linking stuff, I am thinking of contributing some string matching algorithms that are implemented in textdistance https://github.com/life4/textdistance, I'm currently using those already in my code by creating a modified String class and thought that this might be a good addition in the library.

Please inform me if you find this helpful and I am more than willing to implement it in this package.

Regards, Rafael

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/J535D165/recordlinkage/issues/146, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN6QRTAWAFPD67OUXQIDTQLR4267JANCNFSM4PESJYIA .

-- Vincent Brandon Data Coordinator Utah Data Research Center 140 East 300 South | Salt Lake City, UT 84111 (801) 526-9705 [email protected]

utah-vabrandon avatar Jul 22 '20 13:07 utah-vabrandon

Thanks @frellnick, are you one of the contributors/approvers of pull request on this package?

rafmacalaba avatar Jul 22 '20 14:07 rafmacalaba

Nope. But I'd like so see it :)

On Wed, Jul 22, 2020 at 8:59 AM Rafael Macalaba [email protected] wrote:

Thanks @frellnick https://github.com/frellnick, are you one of the contributors/approvers of pull request on this package?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/J535D165/recordlinkage/issues/146#issuecomment-662503918, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN6QRTH2CRL4BW3FQTOSBYDR435GFANCNFSM4PESJYIA .

-- Vincent Brandon Data Coordinator Utah Data Research Center 140 East 300 South | Salt Lake City, UT 84111 (801) 526-9705 [email protected]

utah-vabrandon avatar Jul 22 '20 15:07 utah-vabrandon

Thanks @frellnick !!

rafmacalaba avatar Jul 22 '20 15:07 rafmacalaba