scrapers
scrapers copied to clipboard
Code relating to scraping public police data.
Here's an image of the issue: [ data:image/s3,"s3://crabby-images/ba456/ba4566d884bc094f75da9dbf0772e47b33ee2e95" alt="image" ](url) This was acquired when attempting to install requirements.txt on a fresh remote repo
Part of our official full-time work on PDAP has been making sure we're on the same page about terminology—one thing we are going to change is `dataset` → `data source`....
Bumps [scrapy](https://github.com/scrapy/scrapy) from 2.5.1 to 2.6.2. Release notes Sourced from scrapy's releases. 2.6.2 Fixes a security issue around HTTP proxy usage, and addresses a few regressions introduced in Scrapy 2.6.0....
Bumps [scrapy](https://github.com/scrapy/scrapy) from 2.5.1 to 2.6.2. Release notes Sourced from scrapy's releases. 2.6.2 Fixes a security issue around HTTP proxy usage, and addresses a few regressions introduced in Scrapy 2.6.0....
# Feature Configure a scraper to the new paradigm from https://github.com/Police-Data-Accessibility-Project/planning/issues/146. Doing this with any scraper will close the issue. # Candidates ## Minneapolis https://github.com/Police-Data-Accessibility-Project/PDAP-Scrapers/pull/111 Then, we should run this...
>A process which, when run, submits a scraper’s Extraction and metadata to our database. For now, we're going to use CKAN instead of making our own API from scratch. #...
This is a late-stage project, because it relies on so many blockers. - [ ] https://github.com/Police-Data-Accessibility-Project/PDAP-Scrapers/issues/139 - [ ] https://github.com/Police-Data-Accessibility-Project/PDAP-Scrapers/issues/141 - [ ] submit the results to intake #80 -...
## Task: - [ ] Make a python module that takes the output of a scraper and submits it to our CKAN instance. This can be a pretty informal experiment....
# Request We have a user in Pittsburgh looking for data on K-9 use and training. We should extract all the Pittsburgh police datasets and comb through them for potential...
Document this in [the docs](docs.pdap.io), and point there in the readme of this repo. 1. install python, 2. run this scraper, 3. etc