pyner icon indicating copy to clipboard operation
pyner copied to clipboard

Timeout

Open Liontooth opened this issue 10 years ago • 2 comments

The stanford-ner server finds some strings unparsable. I have relatively dirty data with stray characters like BOM and NULL, not to mention non-US characters like ç, so pyner hangs. Could we have a timeout, perhaps in socket? I'm using ner.SocketNER and I'm surprised and delighted how fast it is - thank you!

Category wishlist

Cheers, Dave

Liontooth avatar Aug 10 '14 21:08 Liontooth

Sorry for the delayed response, but I've been busy with other commitments. Do you have a patch for that socket timeout by any chance? I'm happy to take a look and merge if it looks good.

dat avatar Sep 01 '14 23:09 dat

Thanks for the response!. Sorry, I don't know socket well enough.

Incidentally, I've used pyner for Stanford NLP Parts of Speech, changing NER to POS and it works fine.

Liontooth avatar Sep 02 '14 10:09 Liontooth