tweetokenize icon indicating copy to clipboard operation
tweetokenize copied to clipboard

Tokenization and pre-processing for Twitter data used to train classifiers.

Results 3 tweetokenize issues
Sort by recently updated
recently updated
newest added

Command: `python.exe -m pip install https://testpypi.python.org/packages/e0/d3/85ec655e244873a487243a80e2c0b248a3c91d6d5b3d2e792715606684c7/tweetokenize-1.0.1.tar.gz` ``` Collecting https://testpypi.python.org/packages/e0/d3/85ec655e244873a487243a80e2c0b248a3c91d6d5b3d2e792715606684c7/tweetokenize-1.0.1.tar.gz Downloading https://testpypi.python.org/packages/e0/d3/85ec655e244873a487243a80e2c0b248a3c91d6d5b3d2e792715606684c7/tweetokenize-1.0.1.tar.gz Complete output from command python setup.py egg_info: Traceback (most recent call last): File "", line 1, in File "C:\Users\Edward\AppData\Local\Temp\pip-cn7l8kbc-build\setup.py",...

line 87 ur"|".join( ^ SyntaxError: invalid syntax - sorry, worked out this is Python 2. Any plans for making this Python 3 compatible?

Keeping the case of the tweet intact helps [ lowercase=False ] , but when ignoring the stopwords i would think that one should be case insensitive. What do you think...