Tathagata Chakraborti
Tathagata Chakraborti
Some states like Karnataka have images embedded inside the PDF -- will require an OCR model on top of the standard PDF extraction module. ## Proposal + [ ] [Amazon...
The file extractor works with PDFs (including those that have images inside) only right? If that's that case, then some states like #116 and #98 would require direct processing of...
Some states e.g. Meghalaya and Manipur publish multiple bulletins per day. We need an extension to the bulletin downloader to either download and concatenate a list of files or just...
Seems like for certain cases all the bulletins are not available going back in time, but the links are derivable. Andaman and Nicobar, for example, has PDFs namde as `INT.pdf`...
We need some sort of ping when bulletin schemas change. Since we already have CI/CD set up, all we need are some checks and balances on the extraction script that...
I think we should be able to skip [Step 3](https://github.com/IBM/covid19-india-data/tree/main/frontend#adding-a-new-page) while setting up the frontend -- should be automatic based on the list of states. Imports and routes should be...
Users should be allowed to make queries to the visualizer in natural language. Example technique (but not limited to) for natural language to SQL conversion: https://towardsdatascience.com/natural-language-to-sql-use-it-on-your-own-database-d4cd5784d081.
``` /opt/anaconda3/envs/nl2ltl/lib/python3.8/site-packages/future/standard_library/__init__.py:65: DeprecationWarning: the imp module is deprecated in favour of importlib; see the module's documentation for alternative uses import imp /opt/anaconda3/envs/nl2ltl/lib/python3.8/site-packages/keras_preprocessing/image/utils.py:23: DeprecationWarning: NEAREST is deprecated and will be removed...