Forest Gregg
Forest Gregg
It should be possible https://stackoverflow.com/questions/18313818/how-to-not-load-the-comments-while-parsing-xml-in-lxml, it will need to be changed in parserator.
Please upgrade pip and try again `pip install --upgrade pip` pip shouldn't have tried to build the wheel but used the windows wheel from https://pypi.python.org/pypi/DoubleMetaphone There might also be some...
- 1. is clearly an error - How do you think that 2. should be labeled? - How do you think that 3. should be labeled? - 4. is clearly...
1. Seems to be a common pattern in your data, but is not a pattern I've really seen anywhere else. There are different entities here, a business and a person....
1. Issue one. Agree with your proposal 2. If someone is called "Mary Ann" than Mary Ann is their given name. The concept of the middle name is.... difficult. https://en.wikipedia.org/wiki/Spanish_naming_customs...
Yes, please make a PR!
This looks great. Could you provide more descriptive file names.
female.xml, my_labeled.xml, my_train.xml don't really help me understand what's in these files. Nor do I understand why these are in three files instead of one file.
Thanks, @jonquandt Usually, but not always? There are cases where there in-process packages but not final packages?
@jonquandt thanks so much for explaining about the in-process packages. Here are apparent duplicates that do not fit the in-process, final version pattern. ```python ['https://api.govinfo.gov/packages/CHRG-111hhrg54476/mods', 'https://api.govinfo.gov/packages/CHRG-111hhrg55059/mods'], ['https://api.govinfo.gov/packages/CHRG-112hhrg65734/mods', 'https://api.govinfo.gov/packages/CHRG-112hhrg66317/mods'], ['https://api.govinfo.gov/packages/CHRG-112hhrg73291/mods', 'https://api.govinfo.gov/packages/CHRG-112jhrg73291/mods'],...