covid-data-pipeline
covid-data-pipeline copied to clipboard
Resources for what data is avaliable for each WA county
Hey I am not sure what the best place to coordinate on pulling in new data, but I thought I'd take a survey of what data is available in each WA county as of time of writing.
Only 8 are currently reporting negative cases, 2 reporting hospitalization & 1 reporting recoveries.
https://docs.google.com/spreadsheets/d/1CS-jkQ7olNC1N0pi43fYsqNnGhq5jEBjHE0i1fhttCg/edit?usp=sharing
If y'all are planning on diving down to the county level I'm happy to work on any of them, but I care most about getting WA, GA, CA, TX & NJ up.
If you want to add this as a data source:
- checkout the repo and switch to the dev branch.
- look at the instructions in url_source_parser.py
- add a parser for your worksheet. SheetParser probably get the data into a frame. (but I plan to switch to the official sheets API soonish)
- run test\test_sources.py to see if it works.
Or if you don't want to mess with python, I'll add it tonight.
I haven't really faced up to the need for a real DB yet. I suspect that I'll wind up storing this data in a managed instance on DO if they come through with credits.
Hi @placeybordeaux, Also check out https://github.com/lazd/coronadatascraper/ - some county scraping is also happening with those folks.