twitter-corpus
twitter-corpus copied to clipboard
Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.
The 1.11 version doesn't work anymore. After updating it to 3.5.0, the script worked.
Not currently working out of the box due to Tweepy updates. The version of Tweepy in [requirements.txt](https://github.com/bwbaugh/twitter-corpus/blob/c5368852fbb43034e9d8950ce53f8571d4d263b3/requirements.txt#L4) is `1.11` which [uses the old twitter v1 endpoint](https://github.com/tweepy/tweepy/blob/80e5edd61b207e4f15de11a79474972489c56247/tweepy/streaming.py#L17). The current version is...
We should allow some of the options that are currently hard-coded into the Python script (or potentially in a configuration file) to be specified on the command line instead (potentially...