backend icon indicating copy to clipboard operation
backend copied to clipboard

Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.

Results 107 backend issues
Sort by recently updated
recently updated
newest added

One of the last big chunks of code that need porting to Python from Perl. (Split off of #679)

enhancement
python-rewrite

Aashka pointed out that in multi-platform world they will want to know how many total matching posts their were. Ie. not just how many links were in the tweets, but...

enhancement
api

Some of our topics get too big (👀 elections). @hroberts added support for filtering for stories in a snapshot that have at least: * 1 inlink * 10 twitter shares...

enhancement
api

We are getting lots of download errors from malformed urls. Some are fixable with more aggressive url fixing of things like '>http://foo.bar'. Mostly the problem is feeds giving us relative...

I'm trying to debug some user support requests, but right now calls to story list are going super slow. The below log shows return times from 1 minute to 3...

question
api

The run_remotely() call to extract_and_vector from topics-fetch-link was mysteriously hanging occasionally, eventually causing the fetch link queue to shrink. I changed the run_remotely() call to an add_to_queue() call followed by...

> Now that we consume 3rd party APIs as part of our core pipeline, we need some system for monitoring them to find out when they are down. This should...

enhancement

solr imports are very slow. here are the last ten imports and the size of the import queue: ``` mediacloud=# select * from solr_imports order by solr_imports_id desc limit 10;...

To support labelling attention charts in a topic with top stories (https://github.com/mitmedialab/MediaCloud-Web-Tools/issues/1659), we need to be able to search for the top story within an arbitrary date range. This means...

enhancement
api

The ["Africa Newsroom (Arabic)"](https://sources.mediacloud.org/#/sources/651063) source content isn't parsing correctly. Aashka's SDG topic found this one via spidering. For example, story #1307690431 ([original URL](https://tools.mediacloud.org/api/stories/1307701607/raw.html)) should have a bunch of about SDGs...

bug