node-stopwords
node-stopwords copied to clipboard
English: Apostrophed stopwords
Hi,
I noticed that all stopwords which normally have apostrophes are present in the stopwords list without the apostrophes.
For instance, hasnt is there, but hasn't is not.
Is this standard practice?
Because this leads to some artifacts, for example hell is a stopword instead of he'll. So hell would incorrectly be identified as a stopword. Similarly shell is also incorrectly identified as a stopword...
Just wanted to know the reason behind this design choice