genie-toolkit
genie-toolkit copied to clipboard
Mistakenly tokenize numbers as phone numbers
In the current tokenization code, 6-digit number (e.g., 100000) is tokenized as PHONE_NUMBER_x
. We should simply normalize the potential phone numbers and copy them over - let the parser decide if it's a number or a phone number.