Rahul Bhargava
Rahul Bhargava
On the front end we have a small wrapper around this API, which could be built on or repurposed to support this plugin. Relevant code is in [`pushshift/reddit.py`](https://github.com/mitmedialab/MediaCloud-Web-Tools/blob/topics-multi-platform/server/util/pushshift/reddit.py#L129) in the...
Do we know how they are dealing with the tweet deletion problem? Ie. if a user deletes a tweet later does it disappear from achive.org?
Glad this list feels like a good start. I think #2 has been fairly validated as useful too (see @cindyloo repo [MediaCloud-Image-Tests](https://github.com/mitmedialab/MediaCloud-Image-Tests)). I think you're right that this argues for...
Yeah, I know. I was trying to not get too much detail, but also include a bunch. So I think I've ended up with a rather arbitrary list of the...
I took another pass at adding in more of the features of the full topic mapper engine. Give this one a look over for errors/omissions. [MC Topic Creation Dataflow-2.pdf](https://github.com/berkmancenter/mediacloud/files/4772883/MC.Topic.Creation.Dataflow-2.pdf)
Thx 👍🏽 I'm gonna share it with the rest of the Civic MC team to get feedback. After another rev or two it'll be ready to add into the repo...
@hroberts - where is the code for the spidering engine?
More confusingly - asking to page with more rows than 100 seems to make the story_tags disaster in results. This code returns a story 105831 with story_tags on it: ```python...
Note: these show up as some of the top words in our system if you search for everything since the beginning of time without using a language filter... not a...
More notes on the related project board: https://github.com/berkmancenter/mediacloud/projects/3