Max Harlow

Results 19 comments of Max Harlow

Also match parallelisation?

https://github.com/rhasspy/rapidfuzz

https://github.com/moj-analytical-services/splink

Or Numba? https://github.com/numba/numba

@lsdh thanks! Glad you've found it useful. To clarify -- by matching line numbers do you mean so that ordering matters?

* Jaro-Winkler * Q-gram Sources: * http://www.cs.cmu.edu/~wcohen/postscript/ijcai-ws-2003.pdf * https://github.com/JohnnyBravo75/TwinFinder * http://yomguithereal.github.io/talisman/metrics/distance

Includes Jaro as of 1.14

Locality sensitive hashing? Was used here: https://artificialinformer.com/issue-one/dissecting-a-machine-learning-powered-investigation.html

Fellegi-Sunter? https://github.com/moj-analytical-services/sparklink