pattern
pattern copied to clipboard
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Hi all, I want to work on arabic language support. Any advice on how I should start ?
Take the verb 'were' for example. If I do `tenses('were')` the list of possible tenses includes `('past', 3, 'singular', 'indicative', 'imperfective')`. Certainly though, ```He were correct.``` is incorrect! It should...
singularize('rookies') should be 'rookie' etc. for all the words on singular_ie This fixes clips/pattern#227
All of the words in `singular_ie` inside of inflect.py aren't being singularized properly. The code in question is ``` for x in singular_ie: if w.endswith(x+"s"): return w ``` Instead of...
I wrote a Python script to compare the list of stopwords currently used by pattern's [vector module](https://github.com/clips/pattern/blob/master/pattern/vector/stopwords-en.txt) against other popular stopword lists to check whether an update is required. A...
I wrote a Python script to check the output of pattern's implementation of the Porter2 stemmer (in the vector [module](https://github.com/clips/pattern/tree/master/pattern/vector)) against the output of the original [implementation](http://snowball.tartarus.org/algorithms/english/stemmer.html) by Martin Porter....
Since 2017, twitter support 280 characters. But for compatibility reason, all texts are truncated to 140 characters by default (with a "..." at end of statuses). To be able to...

Hi, In all docus on the internet, it says: pip install pattern from pattern. de import tag, etc. but it gave an undefined message if I do it like this....