covid-data-pipeline icon indicating copy to clipboard operation
covid-data-pipeline copied to clipboard

Resources for what data is avaliable for each WA county

Open placeybordeaux opened this issue 4 years ago • 2 comments

Hey I am not sure what the best place to coordinate on pulling in new data, but I thought I'd take a survey of what data is available in each WA county as of time of writing.

Only 8 are currently reporting negative cases, 2 reporting hospitalization & 1 reporting recoveries.

https://docs.google.com/spreadsheets/d/1CS-jkQ7olNC1N0pi43fYsqNnGhq5jEBjHE0i1fhttCg/edit?usp=sharing

If y'all are planning on diving down to the county level I'm happy to work on any of them, but I care most about getting WA, GA, CA, TX & NJ up.

placeybordeaux avatar Mar 20 '20 20:03 placeybordeaux

If you want to add this as a data source:

  1. checkout the repo and switch to the dev branch.
  2. look at the instructions in url_source_parser.py
  3. add a parser for your worksheet. SheetParser probably get the data into a frame. (but I plan to switch to the official sheets API soonish)
  4. run test\test_sources.py to see if it works.

Or if you don't want to mess with python, I'll add it tonight.

I haven't really faced up to the need for a real DB yet. I suspect that I'll wind up storing this data in a managed instance on DO if they come through with credits.

joshuaellinger avatar Mar 20 '20 21:03 joshuaellinger

Hi @placeybordeaux, Also check out https://github.com/lazd/coronadatascraper/ - some county scraping is also happening with those folks.

julia326 avatar Mar 21 '20 01:03 julia326