covid19-india-data
covid19-india-data copied to clipboard
Publicly available structured COVID-19 data from India, extracted automatically from daily health bulletins published by state governments.
**State Name:** Meghalaya **Bulletin link:** [Link](https://meghalaya.gov.in/announce/announcements) ### Defaults + [ ] Data extraction done | [HowTo](https://github.com/IBM/covid19-india-data/wiki/Adding-a-new-state-to-the-data-extraction-pipeline) + [ ] Added entry into landing page | [HowTo](https://github.com/IBM/covid19-india-data/tree/main/frontend#adding-a-new-page) + [ ] Add...
**State Name:** Rajasthan **Bulletin link:** [Link](http://rajswasthya.nic.in/) | [Link](https://twitter.com/dineshkumawat) ### Defaults + [ ] Data extraction done | [HowTo](https://github.com/IBM/covid19-india-data/wiki/Adding-a-new-state-to-the-data-extraction-pipeline) + [ ] Added entry into landing page | [HowTo](https://github.com/IBM/covid19-india-data/tree/main/frontend#adding-a-new-page) + [...
Some states like Karnataka have images embedded inside the PDF -- will require an OCR model on top of the standard PDF extraction module. ## Proposal + [ ] [Amazon...
- [ ] Verify MP information extraction -- It currently does not work across dates - [ ] Extract summary information provided in Hindi - [ ] Add MP to...
The file extractor works with PDFs (including those that have images inside) only right? If that's that case, then some states like #116 and #98 would require direct processing of...
Some states e.g. Meghalaya and Manipur publish multiple bulletins per day. We need an extension to the bulletin downloader to either download and concatenate a list of files or just...
Seems like for certain cases all the bulletins are not available going back in time, but the links are derivable. Andaman and Nicobar, for example, has PDFs namde as `INT.pdf`...
We need some sort of ping when bulletin schemas change. Since we already have CI/CD set up, all we need are some checks and balances on the extraction script that...
**State Name:** Madhya Pradesh (MP) **Bulletin link:** [Link](http://sarthak.nhmmp.gov.in/covid/health-bulletin/) ### Defaults + [x] Data extraction done | [HowTo](https://github.com/IBM/covid19-india-data/wiki/Adding-a-new-state-to-the-data-extraction-pipeline) + [ ] Added entry into landing page | [HowTo](https://github.com/IBM/covid19-india-data/tree/main/frontend#adding-a-new-page) + [ ]...