node-stopwords icon indicating copy to clipboard operation
node-stopwords copied to clipboard

English: Apostrophed stopwords

Open ccpandhare opened this issue 5 years ago • 0 comments

Hi,

I noticed that all stopwords which normally have apostrophes are present in the stopwords list without the apostrophes.

For instance, hasnt is there, but hasn't is not. Is this standard practice?

Because this leads to some artifacts, for example hell is a stopword instead of he'll. So hell would incorrectly be identified as a stopword. Similarly shell is also incorrectly identified as a stopword...

Just wanted to know the reason behind this design choice

ccpandhare avatar Jul 16 '20 09:07 ccpandhare