streusle icon indicating copy to clipboard operation
streusle copied to clipboard

Lexcat heuristics: PP false positives

Open nschneid opened this issue 6 years ago • 0 comments

Ken Litkowski noticed that PP is erroneously the lexcat for in hope to, just about, and nothing but, which should be P. This is because the UPOS of the last word in the MWE is PART, ADV, or CCONJ. Under the current heuristics in lexcatter.py, an MWE is treated as P only if the last word is tagged as ADP or SCONJ.

nschneid avatar Mar 09 '18 03:03 nschneid