internal-displacement issues

This code in `master` breaks production: ``` //if not using docker //create a pgConfig.js file in the same directory and put your credentials there const connectionObj = require('./pgConfig'); ``` ```...

WanderingStar

Config for running in AWS

The `docker-compose.yml` and `docker.env` files are currently set up with local development in mind. We'll want a production-friendly config. - Don't run localdb - DB config refers to AWS RDS...

WanderingStar

infrastructure

Reliability score for report interpretation

2

Write a function that calculates the percentage of missing fields in `report.Report` after an article has been interpreted. We may expand this later to include weighting or other factors. Discussion...

georgerichardson

beginner-friendly

interpreter

Infrastructure Plan

13

Here's a sketch of an infrastructure plan: ## Development Scrapers run locally (on developer machine) in Docker for prototyping (internal-displacement repo) Write to local DB in docker Can read scrape...

WanderingStar

infrastructure

Deal with update_status errors

In `Pipeline.process_url` we make multiple calls to `article.update_status()`. The update_status method may raise `UnexpectedArticleStatusException` if it appears that the status has been changed in the meantime. `process_url` should be prepared...

simonb83

enhancement

pipeline

Pipeline testing for pdf articles

Make sure pipeline is working with pdf articles for different scenarios: - Non existent / broken url - Non English - Irrelevant - Relevant Ideally include some tests in `tests/test_Pipeline.py`

simonb83

pipeline

Scraping reliability score

1

Write a function in `article.Article` that calculates the percentage of scraped fields which are returned empty. We may consider expanding the definition of scraping reliability later, so suggestions welcome.

georgerichardson

beginner-friendly

interpreter

internal-displacement
internal-displacement copied to clipboard

Metadata

Extract document details from PDF

Integrate LSI classification approach into interpreter

Scraper - Tag content type

pgConfig change breaks production config

Config for running in AWS

Reliability score for report interpretation

Infrastructure Plan

Deal with update_status errors

Pipeline testing for pdf articles

Scraping reliability score

← Metadata

Owner

Metadata

internal-displacement internal-displacement copied to clipboard

Metadata

← Metadata

Owner

Metadata

internal-displacement
internal-displacement copied to clipboard