probablepeople icon indicating copy to clipboard operation
probablepeople copied to clipboard

:family: a python library for parsing unstructured western names into name components.

Results 62 probablepeople issues
Sort by recently updated
recently updated
newest added

ORIGINAL STRING: Bianchette, Michael David PARSED TOKENS: [('Bianchette,', 'Surname'), ('Michael', 'GivenName'), ('David', 'Surname')] UNCERTAIN LABEL: Surname When this error is raised, it's likely that either (1) the string is not...

Hi, the address - 1624 Links Overlook Stone Mountain, GA 30088 - is not parsing correctly. The street address should be: 1624 Links Overlook The city should be: Stone Mountain...

Name parse for "Fred Flintstone P.E" returns as a corporation. Please also advise if corrections are name specific or format specific ... this specific name is representing format not handled.

I'm tinkering with adding new training data. But it's unclear what the difference is between all of the Corporation tags, especially the first three: CorporationName CorporationNameOrganization CorporationNameAndCompany CorporationNameBranchType CorporationNameBranchIdentifier CorporationCommitteeType...

I've been chasing down a bug I'm encountering in Dedupe (which uses probablepeople) and I traced it to this line: https://github.com/datamade/probablepeople/blob/672075cb23a86321d35b3b407b3f2d5e2dcadfa4/probablepeople/__init__.py#L141 For some reason, the name "12society" is identified as...

ORIGINAL STRING: MARVA ROSE ELAINE BAKER PARSED TOKENS: [('MARVA', 'GivenName'), ('ROSE', 'Surname'), ('ELAINE', 'GivenName'), ('BAKER', 'Surname')] UNCERTAIN LABEL: GivenName

PARSED TOKENS: [('HULL', 'Surname'), ('RICHARD', 'GivenName'), ('A', 'MiddleInitial'), ('III', 'Surname')] 'III' should be recognized as generational suffix

Otherwise, this is not there by default. Signed-off-by: Philippe Ombredanne