usaddress icon indicating copy to clipboard operation
usaddress copied to clipboard

RepeatedLabelError:

Open thogarus opened this issue 1 year ago • 2 comments

RepeatedLabelError: ERROR: Unable to tag this string because more than one area of the string has the same label

ORIGINAL STRING: 1449 Engineering Research Ct B100 East Lansing MI 48824-1226 PARSED TOKENS: [('1449', 'AddressNumber'), ('Engineering', 'StreetName'), ('Research', 'PlaceName'), ('Ct', 'StateName'), ('B100', 'StreetName'), ('East', 'StreetNamePostDirectional'), ('Lansing', 'PlaceName'), ('MI', 'StateName'), ('48824-1226', 'ZipCode')] UNCERTAIN LABEL: StreetName

When this error is raised, it's likely that either (1) the string is not a valid person/corporation name or (2) some tokens were labeled incorrectly

To report an error in labeling a valid name, open an issue at https://github.com/datamade/usaddress/issues/new - it'll help us continue to improve probablepeople!

This is with the usaddress-0.5.10

thogarus avatar Aug 14 '24 15:08 thogarus

Ah, yea this one does seem to be a bit different! How are you expecting this to be labelled? I'm particularly curious about the B100 part, since the rest seems to make sense. I'm thinking something like:

Tag Address Part
AddressNumber 1449
StreetName Engineering Research
StreetNamePostType Ct
??? B100
PlaceName East Lansing
StateName MI
ZipCode 48824-1226

Also, do you have any more examples of this type of address that you could share with us?

xmedr avatar Apr 03 '25 22:04 xmedr

We've since got some more updates into the package. What do you think of this parsing:

Address part Tag
1449 AddressNumber
Engineering Research StreetName
Ct StreetNamePostType
B100 OccupancyIdentifier
East Lansing PlaceName
MI StateName
48824-1226 ZipCode

xmedr avatar Jun 11 '25 20:06 xmedr