Media Cloud

Results 11 repositories owned by Media Cloud

ultimate-sitemap-parser

171
Stars
64
Forks
Watchers

Ultimate Website Sitemap Parser

backend

276
Stars
87
Forks
Watchers

Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online media.

sentence-splitter

218
Stars
29
Forks
Watchers

Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.

cliff-annotator

119
Stars
34
Forks
Watchers

A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.

web-tools

63
Stars
31
Forks
Watchers

The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)

api-client

64
Stars
24
Forks
Watchers

Public client for consuming content from the Media Cloud Online News Archive & Directory.

api-tutorial-notebooks

28
Stars
12
Forks
Watchers

A set of jupyter notebooks demonstrating how to use the Media Cloud API.

cliff-docker

15
Stars
9
Forks
Watchers

A Docker image for the CLIFF geolocation software.

date_guesser

42
Stars
7
Forks
Watchers

A library to extract a publication date from a web page, along with a measure of the accuracy.

feed_seeker

30
Stars
12
Forks
Watchers

Find rss, atom, xml, and rdf feeds on webpages