HebrewStopWords
HebrewStopWords copied to clipboard
List of hebrew stop words + script that computed them
HebrewStopWords
This is a list of the 500 most common words (stop words) computed from discussions from the Tapuz People website, on a variety of subjects.
Original corpora contained 1,397,173 tokes.
Tokens containing English characters or digits were removed from the lists.
heb_stopwords.txt - list of stopwords
heb_stopwords_counts.txt - list of stopwords + counts