lemmeknow icon indicating copy to clipboard operation
lemmeknow copied to clipboard

Phone number regex problem

Open cartesianlab opened this issue 3 years ago • 1 comments

Detecting a phone number for any country is no easy task. Some Python libs are doing a great job at it using mix of regex and Machine Learning. Current implementation in lemmeknow is creating lot of obvious false positives:

image

cartesianlab avatar Sep 29 '22 08:09 cartesianlab

Boundaryless mode is on by default for lemmeknow binary, that is why it identified the phone number, if you add -b flag to turn it off, you will see the results only if the whole text matched.

swanandx avatar Sep 29 '22 10:09 swanandx