backend icon indicating copy to clipboard operation
backend copied to clipboard

Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.

Results 107 backend issues
Sort by recently updated
recently updated
newest added

Capturing a request from researchers: for a story discovered on Reddit, they'd like to see which subreddits it was found in. From a system point of view, this means they...

enhancement
api

This came up again with researchers - they want to know which platform a story was first found on in a topic. My old write-up: " We need to be...

enhancement

add topics/cancel end point to end any currently running spider or snapshot jobs for a topic.

It looks like the tags_id partition is not getting created on new stories_tags_map partitions. I have started a manual script to create the missing indexes, but I think the underlying...

@hroberts describes this as: "adding per platform post and author counts for multi-platform seeded content so you can sort stories in the topic by twitter, reddit, fb, etc share count"

enhancement

Now that we have automated map generation, the next step is to make them more flexible. @hroberts describes this as: "adding generation options for these maps so that you can...

enhancement
api

Me: > I think the crawler fetches quite a few (most? all?) downloads twice: fetchers constantly complain with *Object ID %d already exists, will store a new version or overwrite...

bug

One of the last big chunks of code that need porting to Python from Perl. (Split off of #679)

enhancement
python-rewrite

When we find a YouTube video in a topic, we should parse out the channel name and create that as a YT source that you can aggregate by.

enhancement