Jeff Zemerick
Jeff Zemerick
OpenNLP SentenceModel usage should include a sentence model.
How to handle spelled out numbers in place of digits in the filters that use regular expressions with digits?
Adaptive confidence threshold calculations should be distributed. The DescriptiveStatistics is local. In an instance where multiple instances are running, each instance will have its own calculations and that's not ideal.
Create medical abbreviation filter to identify false positives. One possible list of abbreviations: [The BioText Project ](https://biotext.berkeley.edu/data.html)
Ignore span based on whitespace -- Ignore "George Washington" what about "George Washington"
Add OR boolean operator to grammar. Currently, OR can be accomplished to some degree by using multiple filter strategies. It would be ideal to allow expressions like: ``` context ==...
Rename CRYPTO_REPLACE to AESSIV_REPLACE. This also means updating the encryption algorithm to use AES-SIV instead.
Add an option to the Persons filter to also look for titles and suffixes.
Add an optional parameter to the FirstName filter that requires a Surname immediately after. Likewise, add an optional parameter to the Surname filter that requires a FirstName immediately preceding it....
Ignore cities when they appear as part of a court name, e.g. District Court of Baltimore City. This requires consideration about where to implement the feature. If we are looking...