Friedrich Lindenberg
Friedrich Lindenberg
Heuristics: * LOC -> ANY * LOC -> LOC * Observers
How about this as a first attempt: https://sim51.github.io/react-sigma/docs/example/load-graph
## Progress StoryWeb can now load articles, run them through spaCy NER and store the extracted entity tags (e.g. `John Doe`) to a database. In that database, each tag is...
* https://occrp.org * https://www.icij.org/ * https://dossier.center/ * US DOJ - https://www.justice.gov/usao/pressreleases
Very odd, because there's no magic characters here:
To make tesseract happy.
The upload_file method uses a faulty snippet of ActiveState code that will choke on any non-ASCII (or, god forbid, binary) content. I'm using requests.post() as a workaround for now, but...
``` Traceback (most recent call last): File "/opensanctions/zavod/zavod/crawl.py", line 34, in crawl_dataset entry_point(context) File "/opensanctions/datasets/cz/business_register/crawler.py", line 109, in crawl parse_xml(context, res) File "/opensanctions/datasets/cz/business_register/crawler.py", line 64, in parse_xml tree = etree.parse(reader)...
Collecting these here for now to be introduced in batch with significant warning time: * Introduce `Image` entities in practice. * Introduce a `jurisdiction` property type, different from `country` (e.g....