Simon Bedford
Simon Bedford
In `Pipeline.process_url` we make multiple calls to `article.update_status()`. The update_status method may raise `UnexpectedArticleStatusException` if it appears that the status has been changed in the meantime. `process_url` should be prepared...
Make sure pipeline is working with pdf articles for different scenarios: - Non existent / broken url - Non English - Irrelevant - Relevant Ideally include some tests in `tests/test_Pipeline.py`
> In some contexts, information about IDPs is highly politicized, which could be problematic if you're drawing from media reports. You'd want to be very careful in selecting which sources...