pyner
pyner copied to clipboard
Timeout
The stanford-ner server finds some strings unparsable. I have relatively dirty data with stray characters like BOM and NULL, not to mention non-US characters like ç, so pyner hangs. Could we have a timeout, perhaps in socket? I'm using ner.SocketNER and I'm surprised and delighted how fast it is - thank you!
Category wishlist
Cheers, Dave
Sorry for the delayed response, but I've been busy with other commitments. Do you have a patch for that socket timeout by any chance? I'm happy to take a look and merge if it looks good.
Thanks for the response!. Sorry, I don't know socket well enough.
Incidentally, I've used pyner for Stanford NLP Parts of Speech, changing NER to POS and it works fine.