crawlee icon indicating copy to clipboard operation
crawlee copied to clipboard

utils.social phonesFromText issues with slashes

Open bwundo opened this issue 6 years ago • 2 comments

On pages with phone numbers formatted like this: 0175/234234, 0160/345345 and +49151/456456

It just adds 234234, 345345 and 456456 to the result set "phonesUncertain".

bwundo avatar Aug 01 '19 18:08 bwundo

cc @jancurn

mnmkng avatar Aug 21 '19 08:08 mnmkng

Indeed, slashes are currently not recognized. PRs welcome.

On a related note, phone number detection using regular expressions is rather poor, to make it work properly, we'd need to use some AI extractor.

jancurn avatar Aug 21 '19 09:08 jancurn