backend
backend copied to clipboard
Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.
Related PR: https://github.com/mediacloud/systems/pull/229 Original issue: https://github.com/mediacloud/backend/issues/794 @pypt Do you think we need to pass the new Docker Hub credentials as args in the dev scripts, along the lines of what...
>> Does adding select insert_platform_source_pair( 'twitter', 'brandwatch' ); here in mediawords.sql entail a migration, and should I also delete select insert_platform_source_pair( 'twitter', 'crimson_hexagon' );? > I think it does -...
NYT offers an API endpoint that suggests matching entities from their list of ~15k people and ~7k orgs. I wonder if we could use this to build a simple substitution-based...
The commercial https://newsapi.org site appears to have ~30k sources categorized by country of publication and language. Might be worth scraping from their `sources` endpoint and importing that metadata for any...
Rewrite the API from the ground up. Some goals: * python instead of perl * use an api generator to help? * better error reporting * more restful (placeholder issue...
A collaborator reports that doing a story list with `sort=mc.SORT_PUBLISH_DATE_DESC` doesn't work. Sort by `random` and `processed_stories_id` do work fine. Can you check on if this works and is still...
At present, we sometimes find ourselves unable to build certain containers because our [`base`](https://github.com/mediacloud/backend/blob/master/apps/base/Dockerfile) image doesn't change much, and consequently isn't rebuilt very often. That means that the `apt` package...
via @ebndulue As part of our project trying to identify when preprint server URLs are linked to in news, we ran a topic for all stories (so, a * query)...
This is the migration of the topics-mine and topics-mine-public workers from perl to python. This is almost entirely just a line by line migration, with only a few small fixes...