OKR icon indicating copy to clipboard operation
OKR copied to clipboard

Berkeley dependency parser fails on punctuation

Open gabrielStanovsky opened this issue 7 years ago • 1 comments

Turns out that that Berkeley fails on sentences containing 3 subsequent '?'!' marks.
@kleinay overcame this problem by wrapping the single-sentence parsing stage with "try-except" block, logging and then ignoring sentences that it couldn't parse.
@kleinay, can you please post a problematic example sentence?

Thanks!

gabrielStanovsky avatar Aug 14 '17 12:08 gabrielStanovsky

for example (tweet 259284336363319296 from boy_scouts scenerio): "Over 1,000 Files Of Sex Abuse And The Boy Scouts Tried To Cover It Up?!?"

kleinay avatar Aug 14 '17 14:08 kleinay